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I was just going to say, when I was interrupted, that one of the 
many ways of classifying minds is under the heads of arith- 
metical and algebraical intellects. All economical and practical 
wisdom is an extension of the following arithmetical formula: 
2 + 2 = 4. Every philosophical proposition has the more 
general character of the expression a + b = c. We are mere 
operatives, empirics, and egotists until we learn to think in letters 
instead of figures. 
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Foreword 


Modern or “‘abstract”’ algebra is widely recognized as an essential element 
of mathematical education. Moreover, it is generally agreed that the axioma- 
tic method provides the most elegant and efficient technique for its study. 
One must continually bear in mind, however, that the axiomatic method is an 
organizing principle and not the substance of the subject. A survey of algebraic 
structures is liable to promote the misconception that mathematics is the 
study of axiom systems of arbitrary design. It‘seems to me far more interesting 
and profitable in an introductory study of modern algebra to carry a few topics 
to asignificant depth. Furthermore I believe that the selection of topics should 
be firmly based on the historical development of the subject. 

This book deals with only three areas of abstract algebra: group theory, 
Galois theory, and classical ideal theory. In each case there is more depth 
and detail than is customary for a work of this type. Groups were the first 
algebraic structure characterized axiomatically. Furthermore the theory of 
groups is connected historically and mathematically to the Galois theory of 
equations, which is one of the roots of modern algebra. Galois theory itself 
gives complete answers to classical problems of geometric constructibility and 
solvability of equations in radicals. Classical ideal theory, which arose from 
the problems of unique factorization posed by Fermat’s last theorem, is a 
natural sequel to Galois theory and gives substance to the study of rings. All 
three topics converge in the fundamental theorem of algebraic number theory 
for Galois extensions of the rational field, the final result of the book. 

Emil Artin wrote: We all believe that mathematics is an art. The author of a 
book, the lecturer in a classroom tries to convey the structural beauty of mathe- 
matics to his readers, to his listeners. In this attempt he must always fail. Mathe- 
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matics is logical to be sure; each conclusion is drawn from previously derived 
statements. Yet the whole of it, the real piece of art, is not linear; worse than 
that its perception should be instantaneous. We all have experienced on some 
rare occasions the feeling of elation in realizing that we have enabled our list- 
eners to see ata moment’s glance the whole architecture and all its ramifications. 
How can this be achieved? Clinging stubbornly to the logical sequence inhibits 
visualization of the whole, and yet this logical structure must predominate or 
chaos would result.t 

A text must cling stubbornly to the logical sequence of the subject. A lec- 
turer may be peripatetic, frequently with engaging results, but an author must 
tread a straight and narrow path. However, though written sequentially, this 
book need not be read that way. The material is broken into short articles, 
numbered consecutively throughout. These can be omitted, modified, post- 
poned until needed, or given for outside reading. Most articles have exercises, 
a very few of which are used later in proofs. What can be covered in an or- 
dinary course and for what students the text is suitable are questions left to 
the instructor, who is the best judge of local conditions. It is helpful, but 
certainly not essential, for the reader to know a little linear algebra for the 
later chapters—in particular Cramer’s rule. (Vector spaces, bases, and dimen- 
sion are presented in articles 90-95.) 

Finally, I must gratefully acknowledge the assistance of Mrs. Theodore 
Weller and Miss Elizabeth Reynolds, who typed the manuscript, and the 
help of Messrs. George Blundall and John Ewing, who gave their time and 
patience to proofing it. 


Providence, Rhode Island 
January 1, 1970 


{ Bulletin of the American Mathematical Society, 1953, p. 474. Reprinted by permission 
of the publisher. 
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Introduction 


Classical algebra was the art of resolving equations. Modern algebra, the 
subject of this book, appears to be a different science entirely, hardly con- 
cerned with equations at all. Yet the study of abstract structure which charac- 
terizes modern algebra developed quite naturally out of the systematic 
investigation of equations of higher degree. What is more, the modern 
abstraction is needed to bring the classical theory of equations to a final 
perfect form. 

The main part of this text presents the elements of abstract algebra in a 
concise, systematic, and deductive framework. Here we shall trace in a 
leisurely, historical, and heuristic fashion the genesis of modern algebra from 
its classical origins. 


The word algebra comes from an Arabic word meaning “‘ reduction” or 
“restoration.” It first appeared in the title of a book by Muhammad ibn 
Musa al-Khwarizmi about the year 825 a.p. The renown of this work, which 
gave complete rules for solving quadratic equations, led to use of the word 
algebra for the whole science of equations. Even the author’s name lives on 
in the word algorithm (a rule for reckoning) derived from it. Up to this point 
the theory of equations had been a collection of isolated cases and special 
methods. The work of al-Khwarizmi was the first attempt to give it form and 
unity. 

The next major advance came in 1545 with the publication of Artis Magnae 
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sive de Regulis Algebraicis by Hieronymo Cardano (1501-1576). Cardano’s 
book, usually called the Ars Magna, or ‘“‘ The Grand Art,” gave the complete 
solution of equations of the third and fourth degree. Exactly how much 
credit for these discoveries is due to Cardano himself we cannot be certain. 
The solution of the quartic is due to Ludovico Ferrari (1522-1565), Cardano’s 
student, and the solution of the cubic was based in part upen earlier work of 
Scipione del Ferro (1465 ?-1526). The claim of Niccolo Fontana (1500?- 
1557), better known as Tartaglia (“‘the stammerer’’), that he gave Cardano 
the cubic under a pledge of secrecy, further complicates the issue. The bitter 
feud between Cardano and Tartaglia obscured the true primacy of del Ferro. 


A solution of the cubic equation leading to Cardano’s formula is quite 
simple to give and motivates what follows. The method we shall use is due 
to Hudde, about 1650. Before we start, however, it is necessary to recall that 
every complex number has precisely three cube roots. For example, the com- 
plex number! =1 +i has the three cube roots, | (itself), # = —4 +4 V—3, 
and w? = —4— 4\/—3. In general, if z is any one of the cube roots of a com- 
plex number w, then the other two are wz and w?z. 

For simplicity we shall consider only a special form of the cubic equation, 


xe+qx—r=0. (1) 


(However, the general cubic equation may always be reduced to one of this 
form without difficulty.) First we substitute u+v for x to obtain a new 
equation, 


(u? + 3u2v + 3uv? + v*?) +q(u t+ v) —r=0, (2) 
which we rewrite as 
ue +v> + (Bu + qu +v)—r=0. (3) 


Since we have substituted two variables, uw and v, in place of the one variable 
x, we are free to require that 3uv + q = 0, or in other words, that v = —q/3u. 
We use this to eliminate v from (3), and after simplification we obtain, 


q? 
6 2 

ur — —-—=0. 4 
ru" — aa (4) 
This last equation is called the resolvent equation of the cubic (1). We may 
view it as a quadratic equation in wu? and solve it by the usual method to 
obtain 


ik r 
F=-4+ [+ —. 5 
u 4 (5) 


Of course a complete solution of the two equations embodied in (5) gives six 
values of w—three cube roots for each choice of sign. These six values of uw are 
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the roots of the sixth-degree resolvent (4). We observe however that if wis a 
cube root of (r/2) + (7/4) 4 -- (43/27), then v = —q/3u is a cube root of 
(r/2) — /(r?/4) + (q°/27). Consequently the six roots of (4) may be con- 
veniently designated as u, wu, w?u and v, wv, wv, where uv = —q/3. Thus the 
three roots of the original equation are 


a =ut+, Ot. = wu + wv, a; = wu + wn, (6) 
where 
2 3 
Ao oi ii @ -q 
u-=>+ /—+ = and v=—. 
2 44 (27 3u 


In other words, the roots of the original cubic equation (1) are given by the 
formula of Cardano, 
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in which the cube roots are varied so that their product is always —q/3. 


For our purposes we do not need to understand fully this complete solution 
of the cubic equation—only the general pattern ts of interest here. The im- 
portant fact is that the roots of the cubic equation can be expressed in terms 
of the roots of a resolvent equation which we know how to solve. The same 
fact is true of the general equation of the fourth degree. 

For a long time mathematicians tried to find a solution of the general 
quintic, or fifth-degree, equation without success. No method was found to 
carry them beyond the writings of Cardano on the cubic and quartic. Con- 
sequently they turned their attention to other aspects of the theory of equa- 
tions, proving theorems about the distribution of roots and finding methods 
of approximating roots. In short, the theory of equations became analytic. 

One result of this approach was the discovery of the fundamental theorem 
of algebra by D’Alembert in 1746. The fundamental theorem states that every 
algebraic equation of degree n has n roots. It implies, for example, that the 
equation x” — 1 = 0 has n roots—the so-called nth roots of unity—from which 
it follows that every complex number has precisely ” nth roots. D’Alembert’s 
proof of the fundamental theorem was incorrect (Gauss gave the first correct 
proof in 1799) but this was not recognized for many years, during which the 
theorem was popularly known as ‘“‘ D’Alembert’s theorem.” 

D’Alembert’s discovery made it clear that the question confronting alge- 
braists was not the existence of solutions of the general quintic equation, but 
whether or not the roots of such an equation could be expressed in terms of 
its coefficients by means of formulas like those of Cardano, involving only 
the extraction of roots and the rational operations of addition, subtraction, 
multiplication, and division. 
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In a new attempt to resolve this question Joseph Louis Lagrange (1736- 
1813) undertook a complete restudy of all the known methods of solving 
cubic and quartic equations, the results of which he published in 1770 under 
the title Réflexions sur la résolution algébrique des equations. Lagrange 
observed that the roots of the resolvent equation of the cubic (4) can be ex- 
pressed in terms of the roots a,, a, a3 of the original equation (1) in a com- 
pletely symmetric fashion. Specifically, 


v =4(a, + wa, + w7a5), u=4(a, + wa; + wa), 
ov=4(a,+a,+wa,), wu=H(a, + wa, + wa), (7) 


wv =4H(a,+wa3;+w%a,), wu =4H(a3 + wa, + wa). 


All these expressions may be obtained from any one of them by permuting 
the occurrences of «,, @,, a3 in all six possible ways. 

Lagrange’s observation was important for several reasons. We obtained 
the resolvent of the cubic by making the substitution x =u + v. Although 
this works quite nicely, there is no particular rhyme nor reason to it—it is 
definitely ad hoc. However Lagrange’s observation shows how we might have 
constructed the resolvent on general principles and suggests a method for 
constructing resolvents of equations of higher degrees. Furthermore it shows 
that the original equation is solvable in radicals if and only if the resolvent 
equation is. 

To be explicit let us consider a quartic equation, 


x* — px? + qx? —rx+s =0, (8) 


and suppose that the roots are the unknown complex numbers @,, a, #3, %4.- 
Without giving all the details we shall indicate how to construct the resolvent 
equation. First we recall that the fourth roots of unity are the complex 
numbers 1, i, i7, i>, where i=. /—1 and i? =—1, i? = —i. Then the roots of 
the resolvent are the twenty-four complex numbers 


Us jyy = 4(a; + dey + io, + 17%), (9) 


where the indices i, j, k, / are the numbers 1, 2, 3, 4 arranged in some order. 
Therefore the resolvent equation is the product of the twenty-four distinct 
factors (x — u;;,;). That is, we may write the resolvent equation in the form 


(x) = [] @ — viju) = 0. (10) 


ijkl 


Thus the resolvent of the quartic has degree 24, and it would seem hopeless 
to solve. It turns out, however, that every exponent of x in (x) is divisible by 
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4,and consequently (x) =0 may be viewed as a sixth-degree equation in x*. 
What is more, this sixth-degree equation can be reduced to the product of two 
cubic equations (in a way we cannot make explicit here). Since cubics can be 
solved, a solution of the quartic can be obtained by a specific formula in 
radicals. (Such a formula is so unwieldy that it is more useful and understand- 
able simply to describe the process for obtaining solutions.) + 

For quintic, or fifth-degree, equations Lagrange’s theory yields a resolvent 
equation of degree 120, which is a 24th-degree equation in x°. Lagrange was 
convinced that his approach, which revealed the similarities in the resolution 
of cubics and quartics, represented the true metaphysics of the theory of 
equations. The difficulty of the computations prevented Lagrange from 
testing whether his techniques could produce a formula for resolving the 
quintic in radicals. Moreover, with his new insights, Lagrange could foresee 
the point at which the process might break down, and he gave equal weight to 
the impossibility of such a formula. 

A short time afterward, Paolo Ruffini (1765-1822) published a proof of the 
unsolvability of quintic equations in radicals. Ruffini’s argument, given in his 
two-volume Teoria generale delle equazioni of 1799, was correct in essence, 
but was not, in actual fact, a proof. A complete and correct proof was given 
by Niels Henrik Abel (1802-1829) in 1826 in a small book published at his 
own expense. The brilliant work of Abel closed the door on a problem which 
had excited and frustrated the best mathematical minds for almost three 
centuries. 

There remained one final step. Some equations of higher degree are clearly 
solvable in radicals even though they cannot be factored. Abel’s theorem 
raised the question: which equations are solvable in radicals and which are 
not? The genius Evariste Galois (1811-1832) gave a complete answer to this 
question in 1832. Galois associated to each algebraic equation a system of 
permutations of its roots, which he called a group. He was able to show 
equivalence of the solvability of an equation in radicals, with a property of its 
group. Thus he made important discoveries in the theory of groups as well as 
the theory of equations. Unfortunately Galois’ brief and tragic life ended in a 
foolish duel before his work was understood. His theory perfected the ideas 
of Lagrange, Ruffini, and Abel and remains one of the stunning achievements 
of modern mathematical thought. 

At this point we can only leave as a mystery the beautiful relation Galois 
discovered between the theory of equations and the theory of groups—a 
mystery resolved by the deep study of both theories undertaken in the text. 

We can, however, gain some insight into modern abstraction by a short and 
informal discussion of groups. To take an example near at hand, we shall 
consider the group of permutations of the roots x,, x, «3 of the cubic equa- 
tion—which happens to be the Galois group of this equation in general. This 
group consists of six operations, A, B, C, D, E, and /, specified as follows: 
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leaves a, fixed and interchanges the roots a, and «, wherever they occur. 
leaves «, fixed and interchanges «, and a3. 

interchanges a, and a,, leaving a; fixed. 

replaces «, by a, at each occurrence, a, by a3, and a; by a). 

replaces a, by «3, a; by a,, anda, by q. 

is the identity operation, which makes no change at all. 


~moagyr 


For example, the result of applying the operation A to v, as expressed in 
(7), is u. We indicate this by writing 


A(v) = u. 


Similarly, the result of applying the operation E to v is wv, or in other words, 
E(v) = wv. Of course, by definition, /(v) =v. It is easy to verify that by 
applying the six operations A, B, C, D, E, and J to v, we obtain all six of the 
expressions in (7) for the roots of the resolvent equation. 

These’ operations have the property that if any two of them are applied suc- 
cessively, the result is the same as if one of the others had been applied once. 
For example, suppose we apply the operation A to v, obtaining u, and then 
apply the operation D to u, obtaining wu. The result is the same as if we had 
applied the operation C directly to v. We can express this in symbols by 


D(A(v)) = C(v). 


In fact this remains true no matter what we put in place of v. That is, the 
result of first applying the operation A and then applying D is the same as 
applying the operation C. We sum this up in the simple equation: DA = C. 
There are many other relations of this sort among these operations. For 
example, we may compute the result of the composite operation EB on any 
function f(a,, %,, %3) as follows: 


B(f (4, Xo, a3) = f(a; »%2, ty), 
EB(f (a1, &2, %3)) = E(f (a3, %2, %)) =f (a2, a, &3) = C(f(%y, &2, &3)). 


Thus EB = C. The thirty-six relations of this type can be given conveniently 
in a table. We put the result of the composite operation X Y in the X row and 
the Y column. 

We observe now that composition of the operations A, B, C, D, E, and J 
has the following properties. 

(1) For any three operations X, Y, and Z, we have 


X(YZ) =(XY)zZ! 


In other words, the result of first performing the operation YZ and then the 
operation X is the same as the result of first performing the operation Z and 
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Table 1 

Ae Bog tee Dit Ey 
A Cor | 
ie Eh Le Dd Coote tbood 
Ctl) a Be eel AbD HC 
Loe Ay Bo EuedoeaD 
ae a ke A a De te 
ota le OC DE 


then the operation X Y. For example, from Table | we see that AB = D and 
BC = D, and therefore 


A(BC) = AD = B= DC =(AB)C. 


Thus we have verified the equation above for the special case where X = A, 
Y = B, and Z=C. This property of the composition of the operations is 
called associativity. To verify associativity completely from Table 1 we would 
have to make 216 checks like the one above. 

(2) For any operation X we have 


ALT= X=1X. 


In other words, the composition of any operation X with the identity operation 
I always gives X again. This property is easily checked by examining the last 
row and the last column of Table 1. 

(3) For any operation X there is precisely one operation Y such that 


XY=1= YX. 


In other words, whatever the operation X does to the roots a,, a,, a3, Y does 
just the opposite. We call Y the inverse of X and denote it by X ~'. It is easy 
to see from Table | that 


A.’ =A, B= 8 Cc +t=c¢, D'=E, E?=D, i Vey 


Whenever we have a set of operations and a rule for composing them that 
satisfies these three properties, we say that the operations form a group. 

Once we know that a set of operations with a particular rule for composing 
them is a group, we can analyze properties of these operations and their 
composition without regard to the manner in which they are defined or 
the context in which they arose. This simplifies the situation by eliminating 
irrelevant details, and gives the work generality. 

To clarify this process of abstraction, let us consider another group of 
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operations defined in a completely different way. Again we shall have six 
operations, but this time we shall call them by the Greek letters x, B, y, 6, €, 
and 1. These will operate on the rational numbers (except 0 and 1) by the 
following rules: 


a(x) ==, Ax) = ULL 
B(x) =1-x, ax) = ——_, 
ms 


(x) = —, (x) = x, 


where x is any rational number except 0 or 1. We may compose these opera- 
tions and the result will always be one of the other operations. For example, 
we have that da = y, since 


see 3(-) x = ale) 


a) P= ee 


Again, we may make a table of all thirty-six compositions of these six opera- 
tions. 


Table 2 

| OG 6 a bide seamen 
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It is immediately apparent that Table 2 has a strong resemblance to Table 1. 
For example, every occurrence of A in the first table corresponds to an occur- 
rence of « inthe second. Similarly the letters B and B occur in the same posi- 
tions in each table. In fact Table 1 may be transformed into Table 2 by making 
the substitutions: 


A-oa Bop, C>y, D-6, Ee, I-51. 


In other words, these two groups have the same structure as groups even 
though the individual operations are defined in quite different ways. To put 
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it another way, all the facts which depend solely upon the way operations are 
composed will be the same for both groups. In such a case two groups are 
said to be isomorphic. Group theory studies the properties of groups which 
remain unchanged in passing from one group to another isomorphic with it. 

Group theory was called the ‘‘ theory of substitutions ”’ until 1854 when the 
English mathematician Arthur Cayley (1821-1895) introduced the concept of 
abstract group. The convenience and power of the abstract approach to group 
theory was evident by the end of the nineteenth century. Subsequent abstrac- 
tions, such as field and ring, have also proved to be powerful concepts. The 
success of abstract thinking in algebra has been so enormous that the terms 
modern algebra and abstract algebra are synonymous. 

Abstraction is simply the process of separating form from content. We 
abstract whenever we pass from a particular instance to the general case. Even 
the simplest mathematics, ordinary arithmetic, is an abstraction from physical 
reality. In modern mathematics we abstract from previous mathematical 
experience and reach a new and higher plane of abstraction. Indeed, each 
mathematical generation abstracts from the work of preceding ones, continu- 
ally distilling and concentrating the essence of old thought into new and more 
perfect forms. The rewards are great. Not only does abstraction greatly en- 
hance our understanding, it also dramatically increases the applications of 
mathematics to practical life. Even such an apparently recondite subject as 
group theory has applications in crystallography and quantum mechanics. 
Over centuries modern algebra has grown into a large body of abstract 
knowledge worthy of study both for its intrinsic fascination and extrinsic 
application. 
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Chapter I 


Set theory is the proper framework for abstract mathematical thinking. All 
of the abstract entities we study in this book can be viewed as sets with 
specified additional structure. Set theory itself may be developed axiomati- 
cally, but the goal of this chapter is simply to provide sufficient familiarity with 
the notation and terminology of set theory to enable us to state definitions 
and theorems of abstract algebra in set-theoretic language. It is convenient to 
add some properties of the natural numbers to this informal study of set 
theory. 

It is well known that an informal point of view in the theory of sets leads 
to contradictions. These difficulties all arise in operations with very large 
sets. We shall never need to deal with any sets large enough to cause trouble in 
this way, and, consequently, we may put aside all such worries. 


2 JI Set Theory 


The Notation and 
Terminology of Set Theory 


1. A set is any aggregation of objects, called elements of the set. Usually 
the elements of a set are mathematical quantities of a uniform character. 
For example, we shall have frequent occasion to consider the set of 
integers {..., —2, —1,0, 1, 2,...}, which is customarily denoted Z (for the 
German “‘ Zahlen,”’ which means “ numbers’’). We shall use also the set Q of 
rational numbers—numbers which are the quotient of two integers, such as 
7/3, —4/S, 2. 

To give an example of another type, we let K denote the set of coordinate 
points (x, y) in the xy-coordinate plane such that x? + y? = 1. Then K is the 
circle of unit radius with the origin as center. 


2. To indicate that a particular quantity x is an element of the set S, we 
write x € S, and to indicate that it is not, we write x ¢ S. Thus —2 e€ Z, but 
1/2 ¢Z; and 1/2 € Q, but ./2¢Q 

A setis completely determined by its elements. Two sets are equal if and only 
if they have precisely the same elements. In other words, S = T if and only if 
x eS implies x € T and x €7 implies x € S. 

It will be convenient to write x, y, ze S forxe S, yeS,andzeS. 


3. Aset Sis a subset of a set Tif every element of S is an element of 7, or in 
other words, if x € S implies x € 7. To indicate that S is a subset of T we write 
ScT.If Sc Tand Tc S, then x € S implies x € T and x € T implies x € S, 
so that S =T. 

The empty set 0 is the set with no elements whatever. The empty set is a 
subset of every set T. If S is a subset of JT and neither S = 0 nor S = T, then 
S is called a proper subset of T. 


4. Frequently a set is formed by taking for its elements all objects which have 
a specific property. We shall denote the set of all x with the property P by 
{x | P(x)}. Thus, 


Z = {x|x is an integer}. 


To indicate that a set is formed by selecting from a given set S those 
elements with property P, we write {x e S| P(x)}. It isclearthat {x e S| P(x)} 
is always a subset of S. For example, the set of even integers, 


2Z = {x €Z|x =2y, y EZ}, 


is a subset of Z. 
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5. The intersection of two sets S and T is the set S ~ T of elements common 
to both. In other words, 


Sera we Seands eT }: 


The intersection S 4 T is a subset of both S and 7. The sets S and 7 are 
said to be disjoint if SA T= 9. 

We note the following properties of intersection: 

(a) AN(BNC)=(ANB)NC, 

(b) ANB=BQ 4A, 

(c) ANA=A and And=9, 

(d) ANB=A_ ifandonlyif ACB. 


Let S,, S,,..., S, be sets. Then we shall write 


as an abbreviation for 


Sraysger=* OS, =i xix eS, foreach? =1y 2, . 2,7}. 


6. The union of two sets S and T is the set S U T of elements in S or T or in 
both S and 7. In other words, 


SU T = {x|x eS and/or x ET}. 


S and 7 are both subsets of S u T. 
The following properties of union are analogous to those of intersection: 


(a) AU(BUC)=H(AV B)UC, 

(b) AUB=BULUA, 

Ci AwAH=A and AvUd= A, 
(qd) AVUB=B8B ifandonlyif AcB. 


Let S,, S,,..., S, be sets. Then we shall write |_)7_, S; as an abbreviation 
for 
ay Yes US, = {xx € S; for at least oe 7= 1, 2, ..., 7}. 


7. Intersection and union are related by the following distributive laws: 


@, Av(BnC)=(Av B)n(AvC), 
(b) AN(BUC)=(ANB)U(ANC). 


8. The difference of two sets S and Tis the set S — T of elements of S which 
are not elements of 7. In other words, 
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S—T= {xeS|x¢T}. 


S — T is always a subset of S. 
The difference of sets has the following properties: 


(a) A—B=9Q ifandonlyif AcB, 

(b) A—B=A ifandonlyif An B=9Q, 

(c) A—B=A-—C ifandonlyif ANB=ANC, 
(d) A-@=A and A-A=Q, 

(:) A-(BNC)=(A-B)U(A-C), 

(f) A-(BUC)=(A-B)N(A-C). 


8a. The symmetric difference of two sets A and B is the set 
A*B=(A—B)uU(B- A). 


Show that A * B=(A U B)—-(A OB). Show that A * B=Q if and only if 
A = B. Prove that the symmetric difference is an associative operation on sets, 
that is to say, A * (B* C) =(A * B) * C for any three sets A, B, and C. 


8B. Ifevery set ina discussion is a subset of a given set Q, then we call Y the 
universe (of that discussion). The complement of a subset A of Q is the set 
A* =%— A. Demonstrate the following properties of complements for 
subsets of QU: 


(A*)* =A, (AU B)* = A* 0 B*, and (AN B)* = A* vu Be, 
Show that A* « B* = 4 x B. 


9. The cartesian product of two sets S and Tis the set S x T of ordered pairs 
(x, y) with x e Sand y ET. Two elements (x, y) and (x’, y’) of the cartesian 
product S x T are equal if and only if x = x’ and y = y’. Note that the car- 
tesian product T x S is not the same as the cartesian product S x T. (Why?) 

As an example we may consider the coordinate plane as the set R x R 
where R denotes the set of real numbers. Each point of the coordinate plane 
is specified by an ordered pair (x, y) of real numbers, and each such ordered 
pair specifies a point in the plane. Note that (x, y) = (y, x) if and only if x = y. 

Let [a, b] = {x €R|a <x <b} denote the closed interval from a to b. 
Then the cartesian product [I, 3] x [0, 1] may be represented in the coordin- 
ate plane by Figure 1. 

Let S,, $2, -.-, 5, be sets. We denne 


n 
X Spa SPR Sore Xe 


i] 
= 


to be the set of ordered n-tuples (x,, x2, -.., X,) with x; € S;. We shall call 
x; the i-th coordinate of (x,, X2,---, Xq)- 
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[1,3]x[0, 1] 


Figure 1 


9a. Let A and C be subsets of S, and let Band D be subsets of 7. Prove the 
following statements about subsets of S x T: 


(Ax Byin(C x D) =(A.aC) x (BoD), 
(AUC) x (Bu D)=(A x B) U(A x D)U(C x B) U (Cx D), 
(A x B) —(C x D) =(A x (B— D)) U((A—-C) x B). 


9B. Let R, S, and 7 be sets. Are the sets (R x S) x Tand R x (S x T) the 
same ? 


Mappings 


10. Mapping is an abstraction of the concept of function. While a function 
assigns to a given number another number, a mapping assigns to a given 
element of one set an element of another. In other words, a mapping ffrom a 
set XY toa set Y is arule which assigns to each element x € X anelement ye Y. 
To remove the ambiguity residing in the word rule, it is necessary to recast this 
definition in the context of set theory. 

A mapping f with domain X and range Y isa subset of X¥ x Y such that for 
each element x € X there is precisely one element y € Y for which (x, y) ef. 
We write f: X > Y to indicate that fis a mapping with domain X and range Y. 
If f: X + Y and (x, y) ef, we usually write fx for y. It is now fashionable to 
write f: x y in place of (x, y) ef. 

Since mappings are defined as sets (of a special type), it is clear what 
equality of mappings should mean. Two mappings /, g: X > Y are equal if 
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they are equal as subsets of X x Y. It follows that f= g if and only if fx = gx 
for all xe X. 
The identity mapping of a set X is 


ly = (x, ype Xx X|x=y}. 


10a. An arbitrary subset of the cartesian product X x FY is called a relation 
with domain X and range Y. For any relation Rc X x Y and any element 
x € X, we set 


Rx = {ye Y|(x, y) ER}. 


A mapping 1s a special type of relation. Specifically, arelation Rc X x Yis 
a mapping from X to Y if and only if for each x € X, Rx consists of precisely 
one element of Y. Let R denote the set of all real numbers. Which of the 
following relations are mappings from R to R? 


R, = {(, y) eR x R[x? + y? = 1}, 
R,= {(x, y) ER xR [oy = 

R3 = {((x, y) eR «x Ri x* +9? = 1}, 
Ra = {(x%, y) ER x Rix? + y* = 1}, 
R, = {(x, y) ER x R|./x+ Jy =1}. 


11. Let f: X¥ - Y be a mapping. For any subset A of X, the image of A by fis 
the set 


fA=iye Y|y =f eer}: 


The set fX is also denoted Im fand called simply the image of f. The mapping 
Sis called onto ifIm f= Y.IfIm/is a proper subset of Y, then/ is called into. 


12. Let f: ¥ > Y bea mapping. For any subset Bof Y, the inverse image of B 
by f is the set 


f 'B= {xe X|fx e B}. 


Note that f~'Y = X¥. The mapping f/f is said to be one to one if for each 
ye Y,f7~*{y} has at most one element. (f~'{y} =9 if y¢é Imf.) 


12a. Let f: X > Y bea mapping, let A and B be subsets of X, and let C and 
D be subsets of Y. Give a proof or counterexample for each of the fol- 
lowing assertions: 


f(A oB)=fAvfBe fis) =a ayaa 
fVAD B)=fAnfB, of (Can D)=f "Care? 
f(A—B)=fA—-fB,  f-_ (C= D)=f *C=7 
f-'(fA) = A, fF *C)=C. 


Which of the false statements become true when / is one to one? 
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128. For what integral values of » is the mapping f/:R—-R given by 
f(0) = 0 and fx = x" for x #0, a one-to-one mapping? 


13. A mapping /: X — Y is a one-to-one correspondence if fis one to one and 
onto. This is equivalent to saying that for each y € Y, f~'{y} has precisely one 
element. 

If f: X > Y is a one-to-one correspondence, we can define an inverse map- 
ping f-': Y > X which is also a one-to-one correspondence. In fact, we just 
set 


ft ={Q, x) Y x X|(x, y) ef}. 
It follows that y = fx if and only if x =f~'y. Note that (f~')"' =f. 


13a. Let N, = {1,2,...,k}. Define a one-to-one correspondence from 
N, x N, to N,;- 


13B. If S and 7 denote sets, define a one-to-one correspondence from S x T 
ror gx 'S. 


13y. If R, S, and T denote sets, define a one-to-one correspondence from 
moxsy)x TtoR x (S xT). 


14. The power set of aset X is the set 2*, whose elements are the subsets of X. 
In other words, 


2 = {5 Se XQ. 
Theorem. There is no one-to-one correspondence f. X — 2* for any set X. 


Proof. Suppose there were a set X with a one-to-one correspondence 
ff; X + 2*. For each x € X, fx is a subset of X and either x € fx or x ¢ fx. Let 


R= {xe X|x¢7x}. 
Since a one-to-one correspondence is onto, R=/fa for some ae X. Then 


aeéfa=R implies a¢ fa, while a ¢ fa implies ae R = fa. This is a contra- 
diction. 


14a. Construct a one-to-one correspondence 
Os oe 
where A and B are disjoint sets. 


14p. Let N,={1, 2,..., 4}. Construct a one-to-one correspondence between 
the séts 2 and N,. = {1, 2, 3,..., 2°}. 
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15. Let N denote the set of natural numbers {1, 2, 3, ...} and let N, denote 
theusubset af 1, 2,°... pk Wy 

A set S is finite if it is in one-to-one correspondence with one of the sets 
N,, or if it is empty. If there is a one-to-one correspondence from a set S to 
N,, then the number of elements in Sis k. A set which is not finite is called 
infinite. We cannot properly speak of the number of elements in an infinite set. 
However, we shall say that two infinite sets have the same cardinality if 
there is a one-to-one correspondence between them. 

A set S is countable if it is in one-to-one correspondence with N, the set 
of natural numbers. For example, a one-to-one correspondence /: N — Z is 
given by 


fk = (= 1)*[k/2], 


where [x] denotes the greatest integer not exceeding x. Consequently, the set 
Z of all integers is countable. A one-to-one correspondence ¢: N x NN is 
given by 


o(m,n) =43(m+n—2)mt+n—1) +2. 


Consequently, N x N is a countable set. 

Not every infinite set is countable: there can be no one-to-one correspond- 
ence between N and its power set 2; hence, the set 2 is uncountable. (We 
shall apply the words countable and uncountable to infinite sets only.) 


15a. For any finite set S, let %S denote the number of elements of S. Prove 
that for any finite set S, % (25) = 2°). 


15B. Prove that for any two finite sets S and 7, 
*K(SUT)+ *(SAT)= *S 4+ XT. 
15y. Prove that for any two finite sets S and 7, 
*(S x T) = O&S)OKT). 
155. Prove that the cartesian product of two countable sets is countable. 
15e. Prove that the set Q of all rational numbers is countable. 


15¢. Let X~ denote the set of sequences of elements of X. Show that X © is 
uncountable if X has two or more elements. (“‘Sequence”’ here simply means 
an infinite string, x,, X., x3, ..-, of elements of YX.) 


15n. Let S be a set with a countable number of elements. Show that a subset 
of S is either finite or countable. 
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1586. Explain why a finite set cannot be in one-to-one correspondence with 
one of its proper subsets. (In some versions of set theory this is used as the 
defining property for finiteness.) 


16. Let f: X > Y and g: YZ be mappings. Their composite is the map- 
ping gf: X — Z determined by (gf)x = g( fx). We may define gf more formally 
by 


Of = {(x, 2) € X x Z| (fx, 2) €g}. 


If f: X > Y, g: YZ, and h: Z— W are mappings, then h(gf) = (hg) /; 
that is to say, composition of mappings is associative. To prove this, we merely 
need to observe that the two mappings A(g/) and (hg) f have the same value on 
each element x € X: 


(h(gf))x = h((gf)x) = h(g(fx)) = (ha)(fx) = (hg) f)x. 


If f: X > Y is a one-to-one correspondence with inverse f~': Y > X, then 
f-'f=l,y and ff~' =1,. Note that for any mapping f/: X¥ > Y, we always 
have fly = f= lyf 


Equivalence Relations 


17. An equivalence relation on a set X is a subset R of X x X such that: 


(a) (x,x) eR forall xe X, 
(b) (x, y) € R implies (y, x) € R, 
(c) (x,y) € R and (y, z) eR imply (x, z) € R. 


Frequently we prefer to write x = y (R) in place of (x, y) E R. 
If Ris an equivalence relation on X, then for x € X, the R-equivalence class 
of x is the set 


[x], = {ye X|x =y(R)} = {ye X](%, y) € R}. 


When only one equivalence relation is under consideration, we usually suppress 
the subscript R on [x],. 

For any equivalence relation R on X, [x]z = [Ya if x = y(R), and [x]p 0 
[yl], = if x #y(R). The quotient of X by R is the set X/R of equivalence 
classes [x]z, where x runs through the elements of X. [ Jg: X¥> X/R will 
denote the classifying map, defined by [ Jgx = [x]p. 
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17a. Let X bea set partitioned into disjoint subsets, X,, X,,..., X,. (Every 
element belongs to precisely one of the subsets.) Define an equivalence relation 
Ron X for which X/R = {X,, X2, ..., X,}. 


17B. Prove that the intersection of equivalence relations is again an equiva- 
lence relation. 


17y. Let R be an equivalence relation on X and S an equivalence relation on 
X/R. Find an equivalence relation T on X such that (X/R)/S is in one-to-one 
correspondence with X/T under the mapping [[x]g]s+> [x]r. 


18. Congruence of Integers. Let m be a natural number, and let 
R,, = {((a,b)€Z x Zla=b+km;k €Z}. 


R,, 1s an equivalence relation on the set Z of all integers and is called congru- 
ence modulo m. The number mm is called the modulus. We write a = b mod m to 
indicate that (a, b) € R,,; similarly, we write a#b mod m to indicate that 
(a, b) ¢ Ry. 

The equivalence class of a € Z will be denoted [a],,; that is to say, 


[a], = {x €Z|x =amod m} = {xEeZ|x=a+km;k €Z}. 


Every a € Z is congruent modulo m to one of the numbers 0, 1, ..., m — 1. In 
fact if r is the smallest nonnegative integer in [a],,, then 0<r<m and 
a=rmod m. It follows that the quotient set Z,, = Z/R,, is simply 


{OT [lie sey [m — Le 
18a. Prove thata =a’ mod mand b=D’' mod m imply that 
a+b=a'+b'modm and ab=a'b' modm. 


(This allows us to define sum and product on Z,, by the rules [a],, + [db], = 
[a + b}n and [4],,[b]m = [45}m -) 


18B. Let R,, denote congruence mod m onthe set of integers Z. What is the 
equivalence relation R,, © R,? 


18y. Let 
R= {(a, b) €Z x Zla? = b? mod 7}. 
Into how many equivalence classes does R partition Z? 
19. Frequently, we are given amapping/: X > Y and an equivalence relation 


R on the set X, and we want to define a mapping @: X¥/R-— Y such that 
o[ Jr =f. Clearly, this can be done, if at all, only by setting ¢[x]zp =/x. 
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When does this make sense? If x and y are two elements of X which are 
equivalent with respect to the relation R, then we have [x]z = [y]z; it will have 
to follow that ¢[x]z = ¢[y]r, or what is the same thing, fx = fy. We see now 
that the formula ¢[x]z = fx defines a mapping ¢: X/R-— Y if and only if for 
all (x, y) € R, we have fx = fy. When this condition holds, we say that the 
mapping ¢ is well defined. (The terminology is idiotic: @ is not defined at all 
unless f has the required property.) 

To give an example, suppose that we want to define a mapping from the 
set Z,, (defined in 18) to any set Y by means of a mapping /: Z— Y. To do 
this, we must check that for any integers x and y, the condition x = y mod m 
implies fx = fy. 
19a. When is the mapping ¢: Z,, > Z, given by ¢[x],, = [x], well defined ? 


198. Show that addition of elements of Z,, is well defined by the rule 


[a], + (5) = [a + b]n- 


Properties of the Natural Numbers 


20. Let N denote the set of natural numbers {1, 2,°3,...}. We shall take the 
following statement as an axiom: 


Every nonempty subset of N has a smallest element. 


This axiom has, as an immediate consequence, the principle of mathemat- 
ical induction: 


If S is a subset of N such that 1 € S and such that né S impliesn+ 16S, 
then S =N. 


In fact the hypotheses on S imply that the set N — S has no smallest element. 
20a. Prove the alternate form of the principle of mathematical induction: Jf 
Sis a subset of N such that N, < Sand such that N, < S implies N,4, < S, then 
S = N. (Recall that N, = {1, 2,..., k}.) 


20B. Prove by induction the formulas 


S i= nth + 1) and Si? = defn > Tan + 1). 
i=1 i=1 
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20y. Prove by induction the binomial theorem: 


(x + y)" => (7) 


n! 
k!(n—k)} 


n 


where ( % 


denotes the binomial coefficient 


21. The Division Theorem. For all natural numbers a,beéEN there exist 
unique nonnegative integers q and r such thata=qb+randr<b. 


Proof. Let N denote the set of nonnegative integers. It follows from 20 
that every nonempty subset of N has a smallest element. The set 


S={xeN|x=a—kb,keN} 


is not empty because aeéS. Let r be the smallest element of S. Clearly, 
r=a-—qb for some géEN and r<b (otherwise r—b eS, contradicting 
minimality of r). The uniqueness of r is apparent, and r=a—qb=a-—q'b 
implies g’ = qg, which shows uniqueness of q. 


21a. Let beN, b> 1. Show that every natural number can be represented 
uniquely in the form 


r, bk +r, tee tnd tr, 
where ro, 71, --<, & € (OSI ee — 


22. A number bEN divides a number aéN provided a =qb for some 
q EN. To indicate that b divides a, we write b|a, and to indicate that it does 
not, ba. Thus, 2|4, but 2,5. For any natural number 7 we always have 
1|n and n|n. If 1 and if 1 and 7 are the only natural numbers dividing n, 
then v is called a prime number. The first ten primes are 2, 3, 5, 7, 11, 13, 17, 
19, 23, 29. The number of prime numbers is infinite (22+). 


22a. Show that every natural number other than | is divisible by some prime. 


22B. Construct a natural number which is not divisible by any of the prime 
numbers in a given list of primes P;, P2>-+-+» Pk: 


22y. Prove that the number of primes is infinite. 


23. If a and b are natural numbers, then among all the natural numbers 
dividing both a and b there is a largest one, which we call the greatest common 
divisor and denote by (a, b). For example, (6, 8) = 2, (24, 30) = 6, (5, 7) = 1. If 
(a, b) = 1, then we say that q and Db are relatively prime or that a is prime to b. 
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Theorem. If a and b are natural numbers, then there exist integers u and v 
such that (a, b) = ua + vb. 


Proof. Let 
W= {xeN|x=ma+t+nb for m,neZ}. 


Since a,beN and a=1-a+0-b,b=0:a+1-b, the set QC is not empty 
and therefore has a smallest element d = ua + vb for some u, v € Z. We claim 
that d| x for all x € 2. Otherwise, for some x € 2 we have x = qd + r where 
0 <r <d. Since x = ma + nb for some m, n € Z, we have 


r= x —qd = (ma + nb) — q(ua + vb) = (m — qua + (n — qu)b. 


Consequently, r € 2 and r <d, which contradicts the choice of d as the 
smallest element of 2f. Thus, the claim is proved. It follows that d|aandd|b, 
and therefore 1 <d < (a, b). However, d = ua + vb, and as a result we must 
have (a, b)|d, and hence (a, b) < d. It follows that (a, b) = d = ua + vb. 


Corollary. If p is a prime number and p|ab, then p|a or p|b. 


Proof. Suppose p/a. Then(p, a) = 1 = ua + up for some u, v € Z. There- 
fore b = uab + vpb, and p|ab implies p|b. 


23a. Prove that deN is the greatest common divisor of a,b eéN if and 
only if 

(1) dla and ab, 

iwelia and c|b imply cld. 
23B. Prove that m = m' mod n implies (m, n) = (m’, n). 


23y. If a and b are natural numbers, then among all the natural numbers 
divisible by both a and 6 there is a smallest, which we call the least common 
multiple and denote by [a, b]. Show that a|c and b|c imply [a, 5] | c. Show also 
that (a, b)[a, b] = ab. 


236. Prove that a =bmodm and a=b mod a imply a = b mod [m, n]. 


23e. Let a,b,c eéN. Prove that 
[a, (b, c)] = ([a, 5], [a,c]) and (a, [b, c]) = [(a, 6), (@, c)). 


23¢. The Euclidean Algorithm. Givena, b €N, define a decreasing sequence 
of natural numbers, 


SE Sh, S48 S ryt, = 0 
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by the requirement thatr;_, =4,;r; + r;4, fori=0,1,...,”.(Letr_; =aso 
that a=q)b+,1r,.) Show that r, = (a,b). (This method of computing the 
greatest common divisor is found at the beginning of the seventh book of 
Euclid’s Elements.) 


23n. Prove that if p is prime and a, b € Z, 
(a + b)? =a? + b? mod p. 


230. Let a,, a,, ..., a,€Z, not all zero. Define the greatest common 
divisor (a,, @2,..., 4,) and prove the analogue of the theorem in 23. 


24. The Fundamental Theorem of Arithmetic. Every natural number greater 
than | can be expressed uniquely as a product of prime numbers. 


Proof. First we show that each n EN, n> 1, is divisible by some prime. 
Let S denote the set of natural numbers greater than | which are not divisible 
by any prime. If S is not empty, then S has a smallest element ™. Since m|m, 
we cannot have m prime. Therefore, m = ab where | < a < m. Consequently, 
a ¢ Sand there is a prime p which divides a. Then p|m also and m ¢ S, a con- 
tradiction. Therefore, S is empty. 

Next we show that eachn EN, n > 1, is a product of primes. Let S denote 
the set of natural numbers greater than | which cannot be written as a product 
of primes. If S is not empty, then S has a smallest element m and by the argu- 
ment above, m = pm’ for some prime p. Since m’ < m, we have m' ¢ S. As a 
result m’ can be written as a product of primes, p,p. °°: p,, or else m’ = 1. 
Therefore, either m = pp; p2 °° py, Of m= p, and we have that m is a product 
of primes, which contradicts m € S. Consequently, S is empty. 

Finally, suppose there is a natural number greater than | which can be 
written in two ways as a product of primes: 


N=)P\P2°°" Pr= 992 °°° %U- 


Then p, |q,92 °° q, and, by repeated use of the corollary of 23, we may con- 
clude that p, divides one of the q’s, say p, |qg,. Since q, is prime, it follows that 
P,; =q.- AS a result 


P2P3°°* Pr= 492493 °°° 4% 


and a similar argument shows that p, = q, (renumbering the q’s if necessary). 
Continuing in the same manner, we arrive at the conclusion that kK = / and the 
two representations of n are identical (except for the order of the factors). 


Corollary. Every natural number greater than | can be expressed uniquely 
in the form p\'p3> +++ py* where py, P2,---> Py are prime numbers and 
Vi, Vz, +00, EN. 
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24¢. Let a =p''py’ >>: py" and b = p\'p? --- pi*. Show that 
(a, b) Sap nt ilk) gy mint v2, 2) 56 Deter slti) | 


(a, 5] Sopris) py ueriv. 82) 50:0 Pyarrns He) 
24p. Compute the number of divisors of n = p}'p3? *** py. 
25. The Euler Function ¢. For any natural number n we let $() denote the 
number of integers k such that 1 <k <n and (k,n) =1. @ is called the 
totient, indicator, or Euler -function. Since the greatest common divisor 


(k, n) depends only upon the congruence class [k], (238), we may define (7) 
in another way as the number of elements in the set 


Z, = {Ik}, € Z, | (k, 1) = 1}. 


Neither of these characterizations is useful in computing values of ¢, but we 
shall use both to express ¢$(m) in terms of the unique factorization of n as a 
product of prime powers. 


l 
Proposition. If p is prime, then $(p") = r*(1 ~ ;] 


Proof. Clearly, (k, p") = 1 if and only if p 4 k. There are p"”' numbers 
between | and p” which are divisible by p, namely 


lpn 2p, 3p,¥ en(p" p: 


l 
Therefore, ¢(p") = p"— p" |= p*(1 - -) 


Proposition. If (m,n) = 1, then (mn) = o(m)d(n). 
Proof. We shall construct a one-to-one correspondence 
P: Linn > Ly X Zh. 


The proposition then follows immediately, because Z/,, has f(r) elements 
and Z,, x Z, has $(77)$() elements. The mapping p is given by 


PK mn) = (lA ]m > (kn): 


It is routine to verify that p is well defined. 
The mapping p is one-to-one. Suppose P([K]mn) = PC[K'Jmn)» Then we have 
[A] mn = [k’],, and [kK], = [k’],, or what is the same thing, 


k=k'modm and k=k' modaz. 
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Since (m, n) = 1, it follows that 
k=k' mod mn and [k]yn = [K Inn 


(see 238). 
The mapping p is onto. Since (m, n) = 1, there are integers u, v € Z such that 
um + vn = 1. Given [a],, € Z;, and [bd], € Z;,, we set k = bum + avn. Then 


k=avn=amodm and k=bum=bmodn. 


Hence, [k],, = [a],, and [k], = [b],. What is more, (kK, mn) = 1. If p is a prime 
and p|mn, then p|m or p|n. If p|m, then k =amodp and a#0modp 
because (a, m) = 1; therefore pk. Similarly, p|n implies p ’k. This shows 
that p| mn implies p +k for any prime p, which implies that (k, mn) = 1. 


Theorem. For every natural number 


somal -2)0-2)-(-¥) 


where p,, Pz, .-+, Py are the distinct primes dividing n. 


Proof. Write n inthe form p}'p;? -:-: p,* as guaranteed by the corollary in 
24. From the two propositions above it follows that 


on) = BP )602) > 8(PE) = Pr (1 = ~\px(1 r =| pp (1 ~), 


and the formula of the theorem follows immediately. 


25a. Provethat )'4), 6(d) =n. )-4), denotes the sum over all the divisors of n. 
For example, 6 has the divisors 1, 2, 3, and 6, so that 


» $(4) = (1) + P(2) + O(3) + (6). 
25B. The Mobius Function. For every natural number n we define a number 


H(n) by the rules: 


a) #1) ae 
(2) p(n) =0_ if p?|n for some prime p, 
(3) p(n) =(—1)* ifn =p,p2°** p, is a product of distinct primes. 


Show that (m, n) = | implies u(mn) = p(m) p(n) and that 


(n) = » u(d) « (n/d). 


Group Theory 


Chapter 2 


The theory of groups is the proper place to begin the study of abstract 
algebra. Not only were groups the first algebraic structures to be characterized 
axiomatically and developed systematically from an abstract point of view, 
but more important, the concept of group structure is basic to the development 
of more complex abstractions such as rings and fields. Furthermore, group 
theory has an enormous number of applications to many diverse areas of 
mathematics and physics. Hardly any other area of mathematics can match 
the theory of groups in elegance and usefulness. 

This chapter is an exposition of the elementary theory of groups with 
emphasis on groups of finite order. Three advanced topics (the Sylow 
theorems, the Jordan-H6lder theorem, and simplicity of alternating groups) 
are included for applications and depth. 


The Definition of Group Structure 


26. A group is a set G with an operation (called the group product) which 
associates to each ordered pair (a, b) of elements of G an element ab of G 
in such a way that: 


17 
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(1) for any elements a, b,c EG, (ab)c = a(bc); 

(2) there is a unique element e € G such that ea = a = ae for any element 
aeG; 

(3) for each ae G there is an element a~' € G such that a~'a =e = aa™!. 


To be precise a group product should be viewed as a mapping 
pu: G x GG, and the group should be denoted (G, 1) to emphasize the role 
of the product. This notation would distinguish groups (G, yw) and (G, v) 
which have the same underlying set but different products. However, such 
strict formalism obscures intuition and creates notational nuisances. 

A set G = {e} with the single element e and product defined by ee =e 
satisfies (1), (2), and (3) trivially and is consequently called a trivial group. 


26a. A semigroup is a set S with a product which associates to each ordered 
pair (a, b) of elements of S an element abe S in such a way that (ab)c = a(bc) 
for any elements a, b, ce S. Show that the set of all mappings from a given 
set X to itself forms a semigroup in which the product is composition of 
mappings. Show that the set of all one-to-one correspondences of X with 
itself forms a group under composition. 


26B. Let S be a semigroup with an element e such that ea = a = ae for all 
ae S. Show that e is unique. (This indicates that the word unique in (2) above 
is superfluous. It is used to insure the absolute clarity of (3).) 


26y. Let S be a semigroup with an element e such that ea=a for allaeS 
and such that for every ae S there exists a~! € S for which a~'a =e. Prove 
that S is a group. 


265. Let S be a semigroup with a finite number of elements. Suppose that 
the two cancellation laws hold in S; that is, if either ab = ac or ba = ca, then 
b =c. Show that S is a group. 


26e. Let G be a group. Define a new product on G by a* b = ba for any 
a, b eG. Show that G* (the set G with product *) is a group. G* is called the 
opposite group to G. 


26¢. Let Gand G’ be groups. Define a product operation on the set G x G’ 
by the rule (a, a’)(b, b’) = (ab, a'b’). Show that G x G’ is a group under this 
product. (G x G’ is called the direct product of G and G’.) 


26n. A symmetry of a geometric figure is a one-to-one correspondence of 
the figure with itself preserving the distance between points; in other words, a 
symmetry is a self-congruence. The set of all symmetries of a given figure 
forms a group under composition. (Why?) For example, the group of sym- 
metries of a line segment AB consists of two elements, the identity and the 
symmetry reversing A and B. Show that a symmetry of an equilateral triangle 
ABC is completely determined by the way it transforms the vertices. Make a 
complete list of the elements of the group of symmetries of ABC. 
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Figure 2 


260. The group of symmetries of a regular polygon of n sides is called the 
dihedral group D,,. How many elements does D, have? 


261. Let V be a set with the four elements e, a, b, c on which a product is 
defined by the “ multiplication table”’ in Table 3. Verify that V is a group. (V 
is known as the four-group or Viergruppe of Felix Klein.) 


Table 3 

ei) b ¢ 
@ INO -@ iE 
A\ a 2 @ y) 
DAD GC" ue) a 
Cites Db oh eg 


26x. Show that the power set 2” of any set X is a group under the operation 
of symmetric difference A * B (88). 


26. Show that the set (—1, 1) of real numbers x such that -—l1<x<l 
forms a group under the operation x: y = (x + y)/(I + xy). 


26. Find an operation on the set (0, 1) of real numbers x, 0 < x < 1, which 
makes (0, 1) a group in such a way that the inverse of x is 1 — x. 


26v. Generalize the definition of direct product given in 26€ to obtain a defi- 
nition of the direct product G, x G, x -:: x G, of n groups, G,, G2, ..., G,. 


27. Statements (1), (2), and (3) of 26 are known as the axioms of group 
structure. Group structure may be axiomatically characterized in several 
ways, but the particular way given here is the most direct and convenient. 
(1) is called the associativity axiom, because it states that the two ways 
of associating a product of three elements are equal. Consequently, the 
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notation abc is unambiguous. More generally, it follows (after some argument) 
that all the various ways of associating the product of any number of elements 
are equal. Therefore, notation such as a,a, ‘-*+ a, is unambiguous. 

(2) is called the identity axiom and e is called the identity element. 

(3) is called the inverse axiom and a7’ is called the inverse of a. (In 28 we 
shall see that a~' is unique.) 

It is customary to extend the product notation for elements in the following 
ways: 

a" denotes the product of a with itself n times; 

a~" denotes (a~')” and a° =e; 

if A and B are subsets of the group G, then 


AB = {xe G|x =ab,ae A, be B}. 


27a. Show that the five distinct ways of associating a product of four group 
elements in a given order are all equal. 


27B. Let a,a,-+-- a, be defined inductively by the rule: 
a\a,°°° a, = (a,a, Ty A, —1)ap : 


(This gives a particular association for the product of a,, a,,...,a,.) Prove 
that 


(a,a2 ie Ap) (Gn+ 14n+2 in| Gn +m) z Qa2 °° Ant m: 


277. With the result of 27B prove the general associative law, that all the 
ways of associating a product of any number of elements in a given order 
are equal. (This means that expressions such as a,a, «+: a, are unambiguous.) 


28. Proposition. For any elements a,b,c, dofa group, it is true that 
(1) ab=e implies b=a™'; 
(2) (c"')"* =e; 
(Ze, (ca) donk 
(Note that (1) implies that inverses are unique.) 


Proof. 
(1) b = eb =(a7!a)b = a7 (ab) =a 'e = a"! 
(2) Apply (1) with a=c™' and b=c. 
(3) Set a=cd and b=d7'c™!. Then we have ab =(cd)(d~!c~') = 
c(dd~)c~! = cc™! = e and by (1) it follows that d~'c~! = b =a™! =(cd)™. 


28a. Prove that for any elements a,, a,,..., a, of a group, 


(a,a, °*+ a,)~" =a,' +++ az ay’. 
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28B. Let a be an element of a group G. Show that the mapping 2,:G +G 
given by 1, g = ag for any g EG is a one-to-one correspondence. 


28y. A group G is isomorphic to a group G’ if there exists a one-to-one 
correspondence ¢: G > G’ such that (ab) = (fa)(¢d), or in other words, such 
that @ preserves group products. Show that such a mapping @¢ (called an 
isomorphism) also preserves identity elements and inverses. 


285. Let S be a set with an operation which assigns to each ordered pair 
(a, b) of elements of S an element a/b of S in such a way that: 
(1) there is an element 1 € S, such that a/b =1 if and only if a=); 
(2) for any elements a,b,ceéS, (a/c)/(b/c) = a/b. 
Show that S is a group under the product defined by ab = a/(1/b). 


29. A group G is abelian if ab = ba for all a,be€G. Abelian groups are 
named for Neils Henrik Abel (1802-1829), who discovered their importance 
in his research on the theory of equations. It is often convenient to use 
additive notation for abelian groups: the group product of two elements a 
and b is written a + 5; 0 denotes the identity element; —a denotes the inverse 
of a. Then the axioms of group structure read: 


(1) (a+6)+c=a+(b+0), 
2) Or a—a=a-+ 0, 
(3) -a+a=0=a+(-a), 
and of course the defining property of abelian group structure, 
(4) a+b=b+a. 


29a. Which of the following sets are abelian groups under the indicated 
operations ? 


(1) The set Q of rational numbers under addition; under multiplication. 

(2) The set Q* of nonzero rational numbers under addition; under multi- 
plication. 

(3) The set Q* of positive rational numbers under addition; under multi- 
plication. 

(4) The set Q?*? of 2 x 2 matrices with rational entries under addition; 
under multiplication. 

(5) The set Q?*? of 2 x 2 matrices with determinant | under addition; 
under multiplication. 


29B. Prove that a group with less than six elements fs abelian. Construct a 
group with six elements which is not abelian, and one which is. 


29y. Prove that for 1 > 2 the dihedral group D,, is non-abelian. 


295. Show that a group G in which x? = e for every x EG, is an abelian 
group. 
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Examples of Group Structure 


30. The Symmetric Group on n Letters. Let N, denote the set {1, 2,..., n}. 
A permutation of n letters is a one-to-one, onto mapping from N, to N,,. S, 
will denote the set of all permutations of 7 letters. If z, pe S,, then clearly 
the composite mapping zp € S,,. The operation which assigns to each ordered 
pair (x, p) of permutations of n letters their composite, zp, is a group 
product on S,: 


(1) composition of mappings is associative as we have observed in (16); 

(2) the identity mapping (10) 1,: N, 2N,, is the identity element; 

(3) for any xe S,, the inverse mapping (13) x~' € S, serves as an inverse 
element for z in the group-theoretic sense. 


The group S, is called the symmetric group on n letters. We shall devote 
considerable space to the theory of symmetric groups at the end of this 
chapter (76-86). 


30a. Show that the group S, has n! elements. 
308. Show that S, is not abelian for n > 2. 


30y. Construct an isomorphism of the symmetric group S, with the dihedral 
group D,. (See 268.) 


31. The Circle Group. Let K denote the set of points of unit distance from 
the origin in the plane of complex numbers C. In other words 


K=jne C\\|z=— 1h, 


The set K is closed under the usual product of complex numbers: if z, we K, 
then |zw| = |z| |w| = 1 so that zwe K. In fact K is a group under this multi- 
plication: 


(1) multiplication of complex numbers is associative; 
(2) the identity element of K is the complex number | = 1 + 07; 
(3) the inverse of ze K is just 1/z. 


K is called the circle group since the elements of K form a circle in the 
complex plane. Note that K is abelian. 


32. The Additive Group of Integers. Addition is a group product on the set 
of integers Z. The identity element is 0 and the inverse of 2 is —n. Of course 
Z is abelian. 
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33. The Additive Group of Integers Modulo n. Let Z, denote the set of 
equivalence or congruence classes modulo the natural number n (18). Recall 
that we may take 


Z, = {(0],5 Jn. ++» [4 — lab. 


A group product on Z, is given by addition of congruence classes: [a], + [b], = 
[a + b],. Of course we must verify that this addition is well defined; that 
is, we must show that [a’], = [a], and [b’],, = [6], imply [a’ + b’], = [a + 5],. 
(We leave it to the reader.) The identity element of Z, is [0], and the inverse 
of [K],, is 


—[k], a aly =a [n =a k),- 
As the additive notation suggests, Z, is abelian. 
34. The Multiplicative Group Modulo n. We can multiply elements of the 


set Z, by setting [a],[b], = [ab],. (Check that this is well defined.) Clearly, 
for any integer k, 


[ALC = (4), = (11,14, - 


Furthermore, [1], is the only element of Z, with this property since 
[x],{[k], = [k], implies that xk=kmodn; taking k=1, we obtain 
x = 1 mod 2, or what is the same thing, [x], = [I], Consequently, [1], is an 
identity element. 

However, this multiplication is not a group product for Z,, because some 
elements ([0], for example) do not have an inverse. In fact [k], € Z, has an 
inverse if and only if k and 7 are relatively prime. We show this as follows. 
Suppose (k, n) = 1. By 23 there are integers uw and v such that uk + un =1. 
Thus, 


[uk + vn], = [uk], = (walk), = U1), 
and [u], is an inverse for [k],. On the other hand, if 
[w)fA], = [vk], = (1), 


for some integer u, then uk = 1modn and uk + un =1 for some integer v. 
This implies that (k, n) = 1. It is now a routine matter to verify that 


Z,, = {[k], € Z, | (km) = 1} 


forms a group under multiplication of equivalence classes. Note that for a 
prime number p we have 


Z, = LL. dps [2] 5-5 me) NE [p Te L Jy} 


We shall call Z, the multiplicative group modulo n. 
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Subgroups and Cosets 


35. A subgroup of a group G is a nonempty subset H such that (1) 4,be H 
implies ab € H, and (2) ae H implies a“! € H. 

Clearly a subgroup H of a group G is a group in its own right under the 
group product inherited from G. The sets {e} and G are subgroups of a group 
G. A subgroup H of a group G is a proper subgroup when H is a proper 
subset of G. The subgroup {e} is trivial; all others are called nontrivial. 


Proposition. If H is a finite subset of a group G and a,be€H implies ab € H, 
then H is a subgroup of G. 


Proof. We need only show that ae H implies a~‘ € H. A simple induction 
argument shows that 


{xeG|x=a";neN}cH 


whenever ae H. Since H is finite, it must happen that a” = a™ for some 


natural numbers n > m. Since a" € G, we have a” "=a"a "=a"a "=e. 
Either n=m+1 and a=a" "=e, or n>m+1 and a =a" "=e for 
k>1. Thus, either a7! =e=a, or a '=a*"'. In either case a 'eH. 


35a. Show that a nonempty subset H of a group G is a subgroup of G if 
and only if a,b € H implies ab! € H. 


358. Let H,,H,,...,H, be subgroups of a group G. Show that 
H =()\g-, H;, is a subgroup of G. 


35y. Let H be a subgroup of G and let aeG. Let 
H* ={x eG |axa ec Bis 


Show that H®’ is a subgroup of G. (H’ is called the conjugate of H by a.) 
Let 


N(H) = {ae G| H* = H}. 


Show that N(H) is a subgroup of G and that H is a subgroup of N(H). 
(N(H) is called the normalizer of H.) 


355. Let Z, denote the set of elements of a group G which commute with 
all the elements of G; that is, 


Zo = {xEeG|xa=ax,aeG}. 


Show that Z, is a subgroup of G. (Z, is called the center of G.) 


Subgroups and Cosets 25 


35e. If S is any subset of a group G, then there is a smallest subgroup of 
G, say H, containing S. Why? In this case we say that S generates H. Show 
that S generates H if and only if every element of H can be written as a 
product s,52 °°: S,, where s;€ S or s; !€S for each i. (When S generates H, 
we write H = <S).) 


356. The dihedral group D, (268) is the group of symmetries of a regular 
hexagon, ABCDEF. Let a € D, denote rotation counterclockwise by 60°, and 
B denote reflection in the horizontal AD axis. Then Dg, consists of the twelve 


Figure 3 


elements «!B/ where i runs through 0, 1, 2, 3, 4,5 and j =0, 1. Table 4 indi- 
cates how each element of D, acts on the vertices of ABCDEF. 


Table 4 


(1) Determine the subgroup of D, leaving C fixed. 
(2) Determine the subgroup in which A, C, and E are permuted and show 
that it is isomorphic to D3. 
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(3) Determine the subgroup generated by «* and «°f. 

(4) Show how each of the elements Bak for k = 1, 2, 3, 4, 5 may be written 
in the form a‘f/. 

(5) Determine the center of D,. (358.) 


35n. Show that the dihedral group D, is generated by two elements « and 
B satisfying a" = e, B? =e, and Ba =a" |B. 


350. Let X, Y, and Z be subgroups of a group G. Show that Y c X implies 
pase GA — a6 ay 74): 


36. Proposition. Every nontrivial subgroup of Z, the additive group of 
integers, has the form nZ = {x € Z|n| x} for some natural number n. 


Proof. A nontrivial subgroup H of Z contains some integer m # 0. Since 
H also contains —m, the set H ON is not empty and therefore has a least 
element‘n. It clearly follows that ane H for every ae Z, or in other words, 
nZ<H. Suppose me H —nZ. Then m#0 and we may assume m> 0. 
(Otherwise replace m by —m.) By the division theorem (21) m = qn + r, and 
we have 0<r<n, since r=0 implies m=qnenZ. Now we have r= 
m—qneéH and 0<r<vn, which imply that 7 is not the least element of 
H ON. This contradiction forces the conclusion that H — nZ = 9 or H = nZ. 


36a. Describe the subgroup nZ 9 mZ of Z. 


36B. Describe the subgroup of Z generated by n and m, that is, by the sub- 
set {n, m}. 


36y. Describe all the subgroups of Z,,. 
365. Showthat Z x Z has subgroups not of the form nZ x mZ. 


37. Congruence modulo a subgroup. Let H be a subgroup of a group G. 
We use H to define an equivalence relation on G. Let 


Ry = {(x, y)€G x G|x7"y € H}. 


Certainly (x, x) € Ry for all x € G, because x~'x = ee H. If (x, y) € Ry, then 
x” 'yeH,and consequently (x~1y)~1 € H. However, (x~'y)7! = y~!x, and 
thus (y, x) € Ry. Finally, (x, y) and (y, z)in Ry imply x~'y € H and y~'ze H, 
which shows (x~!y)(y~4z) = x71z € H, or (x, z)€ Ry. We have verified that 
Ry is an equivalence relation (17). 

We write x = y mod H when (x, y)e€ Ry. Clearly, x = y mod H if and 
only if y = xh for some h € H. Consequently, we denote the equivalence class 
of x € G by xH to indicate that it consists of all the elements xh where / € H. 
We call xH the left coset of x modulo H. The set of all equivalence classes 
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(left cosets), denoted by G/H, is called the left coset space of G modulo H. 

Since left cosets are equivalence classes, two left cosets are either identical 
or disjoint. To rephrase, xH = yH when x =ymodH, and xHn yH =0 
when x# y mod H. Note that eH = H. 

An example of congruence modulo a subgroup is furnished by Z modulo 
mZ. This is just congruence modulo the integer m as defined previously (18), 
and the quotient set Z/mZ is just Z,,. The coset of the integer k is denoted by 
k + mZ to conform with the additive notation for abelian groups, or [K],, 
as before. 


37a. Show that all the left cosets of a group G with respect to a subgroup 
H have the same number of elements; in other words, show that any two left 
cosets are in one-to-one correspondence. 


37B. Let H bea subgroup of a group G. Define an equivalence relation on G 
which partitions G into right cosets, that is, subsets of the form 


Hy ={xeG|x=hy, he H}. 


Prove that the number of right cosets is the same as the number of left cosets 
of G with respect to H; that is, show that the set of right cosets and the set of 
left cosets are in One-to-one correspondence. 


37y. Show that when G is abelian, every right coset is a left coset modulo H. 


376. Let an equivalence relation on S, be defined by x~t if and only if 


mn = tn. Show that this equivalence relation is congruence modulo a subgroup 
of S,. 


38. The order o(G) of a group G with a finite number of elements is just the 
number of elements of G. Thus, o(Z,,) =n, 0(Z,) = (7) (see 25, 34). The order 
of the symmetric group S, is the number of permutations of 7 letters (30). 
For any such permutation z: N, ~N, there arenchoicesforz(1),n — 1 choices 
for z(2), and so forth; in all there are n(m — 1) ---(1) =a! permutations of n 
letters. In other words o(S,) = n!. 

If a group has an infinite number of elements, it has infinite order. For 
example, the circle group K (31) and the additive group of integers Z (32) 
are groups of infinite order. 

For brevity, a group of finite [infinite] order is called a finite [infinite] 
gr oup. 


38a. Let H and K be subgroups of a group G. Show that HK is a subgroup 
of G if and only if HK = KH. Show that when HK is a finite subgroup of G, 


o(H K) =0(H)o(K)/o(H 7 K). 


388. Determine all the subgroups of the dihedral group D, and the order 
of each. (See 35C.) 
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38y. Let G be a nontrivial group with no proper subgroups except the 
trivial one. Show that G is finite and that the order of G is prime. 


385. Prove that a group G of even order contains an element a  e such that 
2 
a* =e. 


39. The index [G: H] of a subgroup H of a group G is the number of left 
cosets of G modulo H, or in other words, the number of elements of the left 
coset space G/H, provided this number is finite. Otherwise, the index is said to 
be infinite. 

For example, the left cosets of Z modulo mZ are the sets mZ + k where k 
runs through 0, 1, ..., m— 1. Thus, [Z: mZ] = m. 


Proposition. If K is a subgroup of H and H a subgroup of G, then [G: K] = 
[G: H)[H: K], provided these indices are finite. 


Proof. It is easy to see that each coset gH of G modulo H contains the 
cosets g(hK) = (gh)K of G modulo K, where AK runs through all the cosets 
of H modulo K. 


39a. Let H denote the subgroup of S, consisting of all elements z € S, such 
that mn = n. What is [S,: H]? 


39p. If H and K are subgroups of finite index of a group G, show that 
H 7K is a subgroup of finite index of G and that 


[G: HK) <[G: H)[G: K]. 


40. Lagrange’s Theorem. If H is a subgroup of G, a group of finite order, 
then [G: H] = o(G)/o(A). 


Proof. The map f,:H > yH given by f,(h) = yh is a one-to-one corre- 
spondence. The inverse map f, ': yH > H is given by f, '(x) = y~‘x. Thus, 
the left coset yH has the same number of elements as H, namely o(H). In 
fact each left coset of G has o(H) elements. Since left cosets are identical or 
disjoint, each element of G belongs to precisely one left coset. There are 
[G: H] left cosets, and therefore, o(G) = [G: H]o(H). All three numbers are 
finite. Thus, [G: H] = o(G)/o(H). 


Lagrange’s theorem is frequently stated as the order of a subgroup of a 
group of finite order divides the order of the group. Of course the equation 
[G: H] = o(G)/o(H) makes sense only when o(G) is finite (and consequently 
o(H) and [G: H) are also finite). However, even when o(G) is infinite, a sensible 
interpretation can be made: if [G: H] is finite, then o(H) is infinite; if o(H) is 
finite, then [G: H] is infinite. In other words, a subgroup of finite index in an 
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infinite group is of infinite order; a finite subgroup of an infinite group has 
infinite index. 

Joseph Louis Lagrange (1736-1813) made contributions to number theory, 
the theory of equations, and the calculus of variations as well as other fields. 
His observations on the theory of equations set the stage for the development 
of group theory. 


41. If ais an element of a group G, then the order of a, o(a), is the order of 
the subgroup 


<a>) ={xEG|x=a";neEZ}, 


which is the smallest subgroup of G containing the element a. If <a) is finite, 
it happens that a" = a” for some integers n and m such that n > m. Then, 
a"”-"™=e, and n—m>O. Let k be the smallest positive integer such that 
a‘ = e. Writing any given integer n uniquely in the form n = qk + r, where 
0<r<k, we have 


fea = (Gig = a al 


=a 


Thus, each a” equals one of the elements a° = e, a! =a, a’,...,a*~'. In 


other words, 
Kay ="{e,.dya,). fae); 


and the order of a is o(<a)) =k. Therefore, we can state: the order of an 
element a of a group G is the smallest positive integer k such that a‘ = e (if such 
a k exists). If <a) is infinite, then for all positive k, a* 4 e, and a is called an 
element of infinite order. 

If a is an element of a finite group G, then LaGrange’s theorem (40) 
implies that o(a) | o(G), and it follows that a® = e. In other words, the order 
of an element of a finite group divides the order of the group. We use this 
principle in the next article to prove Euler’s theorem. 


41a. Determine the order of a” wherea is an element of order ina group G. 


41B. Let aandb be elements of an abelian group G. Describe o(ab) in terms 
of o(a) and o(d). 


41y. Show that o(axa~') = o(x) for any elements a and x of a group G. 
416. For any elements a and b of a group G, show that o(ab) = o(ba). 


41s. Let G be an abelian group. Show that for any natural number n, the 
set G, = {x € G| o(x) |n} is a subgroup of G. 


41¢. Prove there can be only two distinct groups with order 4 (up to iso- 
morphism). Do the same for order 6. 
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41n. Prove that a group with only a finite number of subgroups must be 
finite. 


42. Euler’s Theorem. If a is any integer prime to m, then a®” = 1 mod m. 


Proof. Since (a,m) = 1, the congruence class modulo m of a belongs to 
Z,,, the multiplicative group modulo m (34). By (41) the order k of [a] € Z,, 
divides the order of Z/,, which is just #(m) where ¢ is the Euler function (25). 
Then, [a]* = [1] implies that [a]*” = [a®”] = [1], or what is the same 
thing, a®” = 1 mod m. 


Corollary. If p is a prime number, then a?~' =1 mod p for any integer a 
not divisible by p. 


The corollary is known as Fermat’s little theorem. 


43. A group G is cyclic if every element of G is a power a“ (k positive, 
negative, or zero) of a fixed element aeé G. The element a is said to generate 
the group G. A cyclic group is always abelian. 

A cyclic group may be finite or infinite. If G is an infinite cyclic group 
generated by aeéG, then a is an element of infinite order in G and all the 
powers of a are distinct. Thus, G={..., a’, a~', e, a’, a”, ...}. The additive 
group of integers Z is an infinite cyclic group generated by the element | € Z. 
(Powers must be interpreted additively: ne Z is the n-th “‘ power’’ of 1.) 

If G is a cyclic group of finite order n generated by a, then G= 
{e, a, a’, ...,a"~'} and the order of ais also n. The additive group of integers 
modulo m, Z,,, is a cyclic group of order m generated by [1],, € Z,,- 


Proposition. A subgroup of a cyclic group is again cyclic. 


Proof. Let G be a cyclic group, generated by ae G, with a subgroup H. 
We may assume H # {e}: a trivial group is always cyclic. Let k be the smallest 
element of the set S = {n e N|a" eH}. Then if a" e H, we have k |n; other- 
wise n = gk + r withO <r <k, anda’ = a"(a")~4 € H contradicting the choice 
of k. Thus, a"€H implies a" = (a‘)? where q =n/k. Consequently H is a 
cyclic group generated by a*. 


Proposition. A group of prime order is cyclic. 


Proof. If G has order p, where p is a prime number, then the order of any 
element a is | or p—the only divisors of p. Since the only element of order 1 
is the identity e, any element a # e has order p and generates G. 
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43a. What is the number of elements of order din a cyclic group of order n? 
(See 25a.) 


43B. Let G be a finite group with the property that for any two subgroups 
H and K, either H c K or K CH. Show that G is a cyclic group whose order 
is a power of a prime. 


43y. Let G and G’ be cyclic groups the orders of which are relatively prime. 
Show that their direct product G x G’ is cyclic. 


435. Show that the direct product of two infinite cyclic groups is not cyclic. 


44. The n-th roots of unity. As another example of a finite cyclic group we 
take up the group of n-th roots of unity. This group will play an important 
role later in the study of Galois theory. 

A complex number ze C is an n-th root of unity if z" = 1. It follows from 
z" = 1, that |z|"=1, |z| =1, and z =e” for some @€ (0, 27]. Thus, every 
n-th root of unity is an element of the circle group K (31). Substituting 
z=e' in z" = 1, we have e'"® = 1, so that nO must be multiple of 2z, or in 
other words, 0 = 2xk/n for some integer k. The value of e2"'*/" depends 
only upon the congruence class of k modulo n. Therefore, there are precisely 1 
n-th roots of unity, namely the complex numbers e2*'*/" for k = 0, 1, 2,..., 
n—1. Letting C denote e?"/", we see that e2"*/" = ¢* and 


NG = {ly ie a ES. CERES, a) 


is a complete set of n-th roots of unity. It is easily verified that K, is a cyclic 
group of order n generated by ¢. Note that 


cow = (CC = Cre = Ce 


Now ¢ is not the only element of K, which generates K,,. (In general the 
generator of a cyclic group is not unique.) An element of K, which generates 
K,, is called a primitive n-th root of unity. The following proposition identifies 
the primitive n-th roots of unity. 


Proposition. (* eK, ={1,6,C7,...,0"~'} is a primitive n-th root of unity 
if and only if k is prime to n. 


Proof. If (k,n) = 1, then 1 = uk + un for some u, v € Z and 
i" me rane Ves = fost = a 


for all m. Thus C* generates K,. On the other hand, if C* generates K,, we 
have 6 =(¢*)" so that uk = 1 modn or | = uk + vn for some u, ve Z and 
(km) =1. 
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44a. Let C be a cyclic group with generator c. What other elements of C 
generate C? (Include the case in which C is infinite.) 


44B. Let { be a primitive n-th root of unity, and let € be a primitive m-th 
root of unity. What is the subgroup of the circle group K generated by ¢ 
and €? 


44y. Describe the group K, 9 K,,. 
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45. Let S be any subset of a group G, and let a be any element of G. The set 
S? = {xeG|axa~* eS} 
is called the conjugate of S by a. We note that (S*)® = S® and that S* = S. 


Proposition. If S is a subgroup of G, then S* is also a subgroup of G. 


1 


Proof. Suppose x, y € S*. Then axa™', aya~' € S, and consequently, since 


S is a subgroup, we have 
(axa~')(aya~!) = axya™' €S. 


Therefore, xy € S*. Similarly, x € S* implies axa~' € S, from which it follows 
that 


(axa) ~" S axa MES Mand xX Tes] 


46. Normal Subgroups. A subgroup H ofa group G is normal if H is equal 
to each of its conjugates, that is, if H*° = H for every element a € G. Normal 
subgroups are also called invariant or self-conjugate subgroups. We write 
H=<G to indicate that H is anormal subgroup of G. 

It is easy to see and worthwhile to note that every subgroup of an abelian 
group is normal: if H is a subgroup of an abelian group G, then x e H? if 
and only if axa~' = aa~1x = x EH; thus, H* = H forall aeG. 


46a. Show that a subgroup H of a group G is normal if and only if every 
left coset of H is equal to some right coset of H. 
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46B. Show that a subgroup H of a group G is normal if and only if abe H 
implies a~'b~! € H for any elements a, be G. 


46y. Show that a subgroup of index 2 is always normal. 


465. Let H and N be subgroups of a group G and let N be normal. Show 
that HO N is a normal subgroup of H. (This will be used in 69.) 


46e. Let H and N be subgroups of a group G, N normal. Show that HN 
is a subgroup of G and that N is a normal subgroup of HN. (This will be 
used in 69.) 


46€. Let N and N’ be normal subgroups of a group G. Show that NN’ is a 
normal subgroup of G. 


46n. Let H, K, and N be subgroups of a group G, Ka H and Na G. Prove 
that NKa NH. 


460. Show that the center Z, of a group G is always a normal subgroup. 
(See 356.) 


461. The quaternion group Q is a group of order 8 which may be presented 
as a group with two generators aand b subject to the relations a* = e, b? = a?, 
and aba = b. To be explicit, the elements of Q are 


e, a, a’, a, b, ab, a*b, a*b 


and the group product is given by Table S. 


Table 5 
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(The entry in the x-row, y-column is the product xy.) 
Prove that Q is Hamiltonian, that is, that Q is anon-abelian group of which 
every subgroup is normal. 


46x. Show that a non-abelian group of order 8 with a single element of 
order 2 is isomorphic to the quaternion group Q (461). 


46). Show that the quaternion group Q is not isomorphic to the dihedral 
group D, (266). 


46u. Let H and K be normal subgroups of G such that H m K = {e}. Show 
that the group HK is isomorphic to H x K, the direct product of H and 
K (266). 

46v. Let H, and H, be subgroups of a group G and N,< ,H,, N,< Hp. 
Show that 


N,(H, 0 N2)3 N,(H, 29 A) 
and 
(H, 0 N,)(H, 0 N,)3 (A, 2 HA). 


(This will be used in 70.) 


47. Theorem. If H is anormal subgroup of G, then the left coset space G/H 
is a group with the product (aH)(bH) = (ab)H. 


Proof. Since the left cosets aH and bH are subsets of G, their product 
(aH)(bA) is the set 


{xe G|x=a'b'; da eaH, b €bH}. 


This product depends only upon the cosets aH and 65H and not upon the 
elements a and b. Therefore, it is well defined. Clearly, (ab)H < (aH)(bH): 
given x € (ab)H, x = abh for some he H, and x = a'b’ where a’ = ae aH and 
b’ = bhe bH. On the other hand, every element of (aH)(bH) has the form 
ahbh' where h, h' € H. Since H is normal and H® = H, we have h = bh"b7! 
for some h” € H. Therefore, 


ahbh! = a(bh"b~")bh’ = (ab)(h'h’) € (ab)H. 


Consequently, (a@H)(bH) = (ab)H. In other words, we have shown that 
when a subgroup is normal, the product of two of its left cosets is again a left 
coset. This product is clearly associative, the left coset eH = H serves as the 
identity, and (@H)~' = a~'H. It is now apparent that G/H is a group. G/H is 
called the quotient group of G by H. 
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Corollary. If H< G and G is finite, then o(G/H) = o(G)/o(H). 
This follows from 40 since o(G/H) = [G: H]. 


47a. Describe the quotient group Z/mZ. 


47B. Zis anormal subgroup of R (real numbers under addition). Show that 
the quotient group R/Z is isomorphic to the circle group K (31). 


47y. Prove that every quotient group of a cyclic group is cyclic. 


476. Let H= G. Show that the order of aH as an element of the quotient 
group G/H divides the order of aeG. 


47. Prove that if G/Z, is a cyclic group, then G is abelian. (Z¢ denotes the 
center of G. See 355 and 468.) 


476. Let Ha G. Show that G/H is abelian if and only if H contains every 
element of the form aba~1b~+, where a, bE G. 


47m. Let Hx G. Let K be a subgroup of G/H and set 
K ={geG|gHe K}. 


Show that K is a subgroup of G containing H and that Ks G if and only if 
K<= G/H. Conclude that there is a one-to-one correspondence between 
subgroups (normal subgroups) of G/H and subgroups (normal subgroups) of 
G which contain H. 


470. Let G x G’ denote the direct product of the groups G and G’ (266). 
Show that HaG and H's G’ imply (H x H’)< (G x G’) and that the 


quotient group (G x G’)/(H x H’) is isomorphic to the direct product 
(G/H) x (G'/H’). 


48. The normalizer of a subset S of a group G is the set 
N(S) = {ae G|S° = S}, 


where S° denotes the conjugate of S by a. (See 45.) 


Proposition. The normalizer N(S) of S in G is a subgroup of G. 


Proof. 
(1) If a, be N(S), then 


5% i: cay’ = st — S, 
and therefore ab € N(S). 
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(2) If ae N(S), then 
ga = (s7y’ = gaa! Ss S 
and therefore a~/ € N(S). 


Proposition. If H is a subgroup of G, then N(H) is the largest subgroup of G 
containing H as a normal subgroup. 


Proof. Clearly H < N(H), and therefore H is a subgroup of N(H). To 
see that H is a normal subgroup of N(H), we note that for ae N(H), the 
conjugate of H by a, in the group N(R), is 


{x € N(H) |axa~1 € H} = H* 0 N(H) =H A N(A) =H. 


Now suppose that H is normal in a subgroup N of G. In N, the conjugate 
of H by a€N is NO H* =H, and therefore H ¢ H’. Similarly, H c H*"’, 
which implies that H* < (H*)*"' = H. Thus, for every ae N we have H? = H 
and ae N(H), or in other words, we have shown that N c N(H). This is what 
is meant by saying that N(H) is the largest subgroup of G in which H is 
normal. 


48a. Using the description of the dihedral group Dg, given in 35€, compute 
the normalizer of the sets {a}, {8}, and («> = {e, a, «7, a, a*, a>}. 


48B. Determine the normalizer in the symmetric group S, of the subgroup 
H of all permutations leaving n fixed. Determine all the conjugates of H 
ingS,.. 


48y. Show that N(S*) = N(S)* for any subset S of a group G. 


49. Theorem. The number of distinct subsets of a group G which are con- 
jugates of a given subset S is [G: N(S)], the index in G of the normalizer 
of S. 


Proof. S* =S? if and only if S®”' =S. In other words, S* = S? if and 
only if ab~',€ N(S), which is equivalent to a~' = b~* mod N(S). This sets up 
a one-to-one correspondence from the class G(S) of subsets of G conjugate 
to S, to the left coset space G/N(S) given by S* > a~1N(S). Since G/N(S) has 
[G: N(S)] elements, the theorem follows. 


49a. Let S bea subset of a group G, which has exactly twoconjugates. Show 
that G has a proper nontrivial normal subgroup. 


49B. Let H be a proper subgroup of a finite group G. Show there is at least 
one element of G not contained in H or in any of its conjugates. 
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50. The center of a group G is the set 


Zg = {x €G|xa = ax, for all ae G}. 
Proposition. The center of a group is a normal subgroup. 


Proof. If x, yeZg, then for any ae G we have xya = xay = axy, so that 
xyeZ,. If xeZ,, then for any ae G, xa~' = a~'x, which implies that 


me a=(@ x) Sa Sa, 


so that x_'eZ,. Thus, Zg is a subgroup of G. For any a€G we have 
mee i and only if ata “= xaa ‘'=xeEZ,. Therefote, ZZ = Z, for all 
aeéG, and Z, is a normal subgroup of G. 


Proposition. For any group G, Zg = {xeG|N(x) =G}. 


Proof. N(x) denotes the normalizer of the set {x}. If N(x) =G, then 
{x}* = {x} for each ae G, or what is the same thing, axa~' = x and ax = xa 
for all ae G, which means xe Z,. Reversing all these implications shows 
that N(x) = G implies x EZ. 


50a. Show that the center of the symmetric group S, is trivial for n > 2. 
508. Compute the center of the dihedral group D,. (See 268 and 35n.) 
507. Compute the center of the quaternion group Q (461). 


51. The Conjugacy Class Equation of a Group. Anelementxof agroupG is 
conjugate to an element y of G if x = aya ' for some element a of G—or 
what is the same thing, if the set {y} is a conjugate of the set {x} as defined in 
45. It is not difficult to verify that conjugacy is an equivalence relation and 
divides G into disjoint conjugacy classes. The number of elements in C,, the 
conjugacy class of x, is [G: N(x)], the index in G of N(x), the normalizer of 
{x}. (This follows from 49, since the number of elements in C, is the number 
of conjugates of the set {x}.) Clearly, x € Z,, the center of G, if and only if 
Pa) =G, (GCG: N(x)] = 1, and C,, = {x}. 

Suppose now that G is a finite group, and that consequently there are a 
finite number of conjugacy classes. By the above remarks it is clear that either 
xE€Z, or C, © G-—Z,. Let x,,x2,...,X,€G be elements obtained by 
choosing one element from each conjugacy class contained in G — Z,,. For 
any x € G, either x € Z, or x is conjugate to one x;. Counting up the elements 
of G, we conclude that 
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o(G) = 0(Z,) +y [G: N(x,)]. 


This is called the conjugacy class equation of G. 


5la. Divide the elements of the quaternion group Q into conjugacy classes 
and verify the conjugacy class equation. 


51f. Prove that a group of order p? is abelian (p a prime). 


52. Prime Power Groups. We illustrate the results of the last few articles 
with a brief study of groups whose order is a power of a prime number. Let 
G be a group of order p”, where p is a prime number and n > 1. 


Lemma 1. G has a nontrivial center. 


Proof. In the conjugacy class equation of G 
o(G) = o(Z¢) + Y [G: N(x], 
ia 


we must have [G: N(x;)] > 1 so that p|[G: N(x,)] for each i= 1, 2,..., m. 
Since p|o(G), it follows that p | o(Z,) and therefore Z, # {e}. 


Lemma 2. Ghas a proper nontrivial normal subgroup. 


Proof. Z,, the center of G, is a normal subgroup of G and is nontrivial 
by lemma 1. If Zg is proper, we are finished. If Zg = G, then G is abelian and 
any element ae G, a #e, generates a nontrivial normal subgroup <a). If 
o(a) = o(<a)) < o(G), then we are finished. On the other hand, o(a) = o(G) = p” 
implies o(a’) = p" *, and a? generates a proper nontrivial normal subgroup. 


Theorem. If G is a group of order p", where p is a prime number and n> 1, 
then G has a sequence of subgroups {e} = Gg © G, © *:: CG, =G such that 
0(G,) = p*, G, is normal in G,,, and the quotient group G,+,/G, is cyclic 
of order p. 


Proof. The proof is by induction on the integer n. For n = | it is trivial 
because {e} = Gp < G, =G, and a group of order p, p prime, is cyclic (43). 
Suppose the theorem is true for all groups of order p* for k <n, and let G 
be a group of order p”. By lemma 2, G has a proper nontrivial normal sub- 
group. Among all such subgroups, finite in number, let H be one of maximal 
order, say o(H) = p', t <n. We want to showt =n-—1. 


The Sylow Theorems 39 


Suppose t <n —1; then G/H is a group of order p""'> p?. By lemma 2, 
G/H has a proper, nontrivial, normal subgroup N. Let H’ denote the subset 
{g€G|gH €N} of G. H’ is a normal subgroup of G: 


(1) If 9,9'€H’, then gH, g'HEN, and since N is a subgroup of G/H, 
(gH)(g'H) = g9'H EN and gg’ € H’. 
(2) If ge H, then gH EN, and since N is a subgroup, (gH)"'=g"'HeEN 
so thatg-'eH’. 
(3) (H’) = {ge G|aga~' € H'} = {ge G|aga'HeEN} 
= {ge G|(aH)(gH)(aH)~* € N} 
={geG|gHe N*"} 
={geG|gHeN} 
= 7. 


Clearly, H’/H = N and therefore 
o(H")/0(H) = [H’: H] = o(N) 2 p, 


or in other words, o(H’) > po(H) contradicting maximality of H. Conse- 
quently, the assumption that t <n — 1 is incorrect, and t =n — 1. 

To finish the proof we set G,_, = H and apply the inductive assumption 
to the group G,_, of order p"~' to obtain the groups Gy c G, © °:' CG, _}. 
Since G/H = G,/G,,_, is a group of order p and p is prime, it is automatically 
cyclic (43). 


The sequence 
{e} = Gp CG, o°'cG,-,CG,=G 


is called a composition series for G. (See 73.) 


The Sylow Theorems 


Some information about the structure of a finite group can be obtained from 
its order alone. The most important results in this direction are the three 
theorems of Sylow which are proved in the next few articles. First we give a 
basic theorem on transformation groups which will simplify the proofs. 
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53. Transformation Groups. A group G acts on a set X (as a group of 
transformations) if to each pair (g, x) € G x X there is associated an element 
g * x € X in such a way that 


(1) g * (h* x) =(gh) * x for all g,h EG and allxe X; 
(2) ex x =x forall xe X. (e is the identity element of G.) 


We note that each geéG determines a one-to-one correspondence 
g: X > X, given by g(x) = g * x, whose inverse is g-': X > X. (These one- 
to-one correspondences are sometimes called transformations of X.) 

As examples of transformation groups we note that every group G acts 
on itself by the rule g * h = gh for all g, h EG, and more generally, if H is a 
subgroup of G, then G acts on the left coset space X = G/H by the rule 
9 *(g'H) = (gg')H. 


53a. Explain how the symmetric group S, acts as a group of transformations 
on any set X with n elements. 


53B. Let G be a group of transformations of a set X. Define Go by 
Go={gEG|g*x =x, all xe X}. 
Show that Go is anormal subgroup of G. 


53y. A group of transformations G ofa set X acts effectivelyon X ifg* x =x 
for all x € X implies g = e. In other words, G is an effective transformation 
group of X whenever Go = {e}. (See 538.) Show that if G is any transforma- 
tion group of X, effective or not, the quotient group G/G, acts effectively 
on X. 


535. Show that a group G acts effectively on the left coset space G/H if 
and only if (\,¢¢ H® = {e}. 


53e. Show thatthe set (X) of one-to-one correspondences of a set X 
forms the largest effective transformation group of X. 


54. Orbits and Stabilizers. Let G be a group acting on the set X. We define 
an equivalence relation ~ on X by setting x ~ y if and only if y =g * x for 
some géG. An equivalence class under ~ is called an orbit. The orbit of 
x eX is simply the set 


G*ex={yeX|y=g*x, for some ge G}. 


The quotient set X/~ is called the set of orbits of X under the action of G. For 
each x € X, the set 


G,={geG|g*x=x} 


is a subgroup of G called the stabilizer of x. 
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Theorem. IfG is a group acting ona finite set X, then the number of elements 
in the orbit of x € X is the index in G of the stabilizer G,. 


Proof. The mapping ¢:G—G*x given by $(g) =g * x is clearly onto. 
Furthermore, (g) = (A) if and only if g~'h € G,, or what is the same thing, 
g =h mod G,. This implies there is a well-defined, one-to-one correspondence 
G/G,>7G*x given by gG,t+g*x. It follows that G*x has the same 
number of elements as G/G, , namely [G: G,]. 


One illustration of this theorem is as follows. Let 2° denote the power set 
of a finite group G, that is, the set whose elements are the subsets of G. 
(It is not hard to see that 2° has 2% elements.) Let G act on 2° by the rule 


g*S=S9' ={xeGlg 'xgeS}. 


Then the orbit of a set Sc G is just @(S), the collection of conjugates of S, 
and the stabilizer of S is just its normalizer N(S). Thus, the theorem above 
implies 49 as a special case—the number of conjugates of S is [G: N(S)]. 


54a. Let H be a subgroup of a group G. Then H acts on G by the rule 
h*x=hx forhe H, x eG. What is the orbit of xe G under this action? 
What is the stabilizer H,.? 


54B. Let G be a group acting on a set X. Show that two elements of X 
which belong to the same orbit have conjugate subgroups of G as stabilizers. 


54y. Let H and K be subgroups of a group G. K acts on the left coset space 
G/H by k * (gH) = (kg)H. What is the stabilizer of gH? What is the number 
of elements in the orbit of gH? 


55. Cauchy’s Theorem. If p is a prime dividing the order of a finite group 
G, then G has an element of order p. 


Proof. Let n be the order of G. Let 
X = {(a, a2,...,a,) € G? | aa, +++ a, =e}. 
(G? denotes the cartesian product G x G x --- x G with p factors.) Since the 
first p — | coordinates of an element of X may be chosen arbitrarily from G, 
thereby determining the last one, it is clear that xX has n?~! elements. 
Let C be a cyclic group of order p generated by the element c. Let C act 
on the set X by the rule 


C# (a), 42, ...,G,) = (42, 43,..., 4), a). 
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By 54 the number of elements in any orbit divides the order of C, which is p. 
Thus, an orbit has either p elements or one element. Let r be the number of 
orbits with one element, and let s be the number of orbits with p elements. 
Then r + sp = n?~', the number of elements of X. 

By hypothesis p |, and consequently, r + sp = n?~' implies that p|r. We 


know that r 4 0 because the orbit of (e, e,..., e) € X has only one element. 
Therefore, there are at least p orbits with a single element. Each such element 
has the form (a, a,...,a)¢X so that a? =e. Thus, G contains at least p 


elements solving the equation x? =e. Clearly then, G contains an element 
a #e such that a? = e, and a must have order p. 


Remarks. This beautiful proof of Cauchy’s theorem is due to James H. 
McKay (‘Another proof of Cauchy’s group theorem,” American Mathematical 
Monthly, vol. 66 (1959), p. 119). 

Augustin Louis, Baron Cauchy (1789-1857) was a French mathematician 
whose prodigious contributions are important to all branches of mathematics. 
Among his contributions to algebra, in addition to the theorem above, is a 
theorem concerning the number of distinct values assumed by a function of 
several variables when the variables are permuted. This theorem lies at the 
crux of Abel’s argument that algebraic equations of the fifth degree are not 
generally solvable by radicals. (See 85a and 149.) Cauchy is perhaps the one 
mathematician entitled to be called the founder of group theory. 


56. First Sylow Theorem. If p is prime and p" divides the order of a finite 
group G, then G has a subgroup of order p". 


Proof. Suppose o(G) = p"m. The number of subsets of G which have 
precisely p" elements is the binomial coefficient 


N= (a ~ (Pin)pin = |) "ns ee 
a (p"\(p" — 1) ++ (p"— + () 


(This is the number of ways that a set of p" elements can be chosen from a 
set of p"m elements.) For 0 <i< p" the highest power of p dividing p"m — i 
is the same as that dividing p" — i. (Why?) Thus, all the factors of p in the 
numerator and the denominator of N cancel out, except those of m. Conse- 
quently, mand N have the same number of factors of p. Let p" be the highest 
power of p dividing mand N. 

G acts on the set ¥ of all subsets of G with p" elements by the rule 


g*S= 9S = eG |x— gs: sen, 


for any Se ¥. If every orbit under this action were divisible by p’*', we 
would have p’*'|N. (Why?) Therefore, there is at least one orbit, say 
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{S,,S2,-.., S,}, for which p’*' Yk. Let H denote the stabilizer of S,. We 
know that k = [G: H] = 0(G)/o(H) by 54. Since p’|m, we have p"*"|p"m. 
However, p"m = 0o(G) = ko(H), and therefore, we have p"*"|ko(H). Since 
p’*' ¥k, we must conclude that p”|o(H) and that p" < o(A). 

On the other hand, because H stabilizes S, we have Hg S, for any 
g €S,. Thus, 


o(H) = * (Hg) < &(S,) = p”. 
Combined with p” < o(H), this implies o(H) = p” and the proof is complete. 


Remarks. This elegant argument, which is a great improvement over the 
older method of double cosets, comes from a paper of Helmut Wielandt, 
“Ein Beweis fiir die Existenz der Sylowgruppen,” Archiv der Matematik, 
vol. 10 (1959) pp. 401-402. The original theorem of the Norwegian mathe- 
matician Ludwig Sylow (1832-1918) stated only the existence of a subgroup 
of order p” where p” is the highest power of p dividing the order of the group. 
Such a subgroup is called a p-Sylow subgroup. Of course the more general 
statement above is easily deduced from Sylow’s theorem and the structure of 
prime power groups (52). Finally, we observe that Cauchy’s theorem. (55) 
is a special case of the first Sylow theorem, since the existence of a subgroup 
of order p implies the existence of an element of order p. 


57. Second Sylow Theorem. All the p-Sylow subgroups of a finite group 
are conjugate. 


Proof. Let G be a finite group of order p"m, where p/m and n> 0. Let 
H be a p-Sylow subgroup of G. Of course o(H) = p" and [G: H] =m. Let 
S,, S,,..., S,, denote the left cosets of G mod H. G acts on G/H by the rule 
g * S;=g9S;. Let H; denote the stabilizer of S;. 

All the groups H; are conjugates of H. To see this we note that by 54, 
o(H ;) = p", while gHg~! ¢ H; if S; = gH. Since o(gHg™') = o(H) = o(H;), we 
have gHg"' = H;,. 

Let H’ be a second p-Sylow subgroup of G. Then H’ also acts on G/H 
by the same rule as G. Since p ) m, there is at least one orbit (under H’) with 
a number of elements not divisible by p. We may suppose that S,, S,,...,S, 
are the elements of an orbit where pr. Let K = H’ 0 H,. Then K is the 
stabilizer of S, under the action of H’. Therefore, [H’: K] =r. However, 
o(H'’) = p" and pr, from which it follows that r= | and K = H’. Therefore, 
o(K) = o(H’) = o(H,) = p", and H’ = K = H,. Thus, HW’ and H are conjugate. 


58. Third Sylow Theorem. The number of p-Sylow subgroups of a finite 
group is a divisor of their common index and is congruent to | modulo p. 
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Proof. Let G be a group of order p"m, where n > 0 and p } m. Suppose r 
is the number of p-Sylow subgroups of G. Then we want to show that r|m 
and that r= 1 mod p. 

As before, let H be any one of the p-Sylow subgroups of G. Of course 
o(H) = p" and [G: H] = m. We shall denote the elements of the left coset 
space G/H by S,, S,,...,5,,. G acts on G/H by the rule g*#S =@S for 
SeéG/H. There is only one orbit under this action, namely the whole of 
G/H. (Why ?) Therefore the stabilizer of each S; is a subgroup in G of index 
m and order p”. In other words, each coset S; has a p-Sylow subgroup as 
stabilizer. 

On the other hand, as we shall soon see, each p-Sylow subgroup is the 
stabilizer of one or more of the cosets S,,S,,...,S,,- Clearly, H is the 
stabilizer of the coset H, which must occur among the S,’s. Let S,, S,,..., Sy, 
be the elements of G/H whose stabilizer is H. By the second Sylow theorem 
any other p-Sylow subgroup of G is a conjugate gHg™! of H. It is easy to see 
that gHg™! stabilizes the cosets gS,,gS,,...,gS,- Consequently, we see 
that each one of the r distinct p-Sylow subgroups of G is the stabilizer of 
exactly k elements of G/H. Hence, we conclude that m = kr and that r|m. 

Now we restrict our attention to just the action of H on G/H. Unless 
H =G and r =1, there is more than one orbit. (Why?) Applying the orbits 
and stabilizers theorem of 54 to this restricted action and using the fact that 
o(H) = p", we can distinguish two cases: 

(1) the orbit of S; contains p' elements for some t, 0 <1 <n; 

(2) the orbit of S; contains only the element S;. 

Clearly, the second case occurs if and only if S; is one of the cosets S,, S,,..., 
S, whose stabilizer is H. Thus, counting the elements of G/H, we conclude 
that m =k + up or that m =k mod p. 

The previous conclusion that m=kr along with m=kmodp yields 
kr =k mod p, from which it follows that r=1 mod p, since k#0 mod p. 
The proof is accomplished. 


59. As asimple example of the direct application of the Sylow theorems, we 
note that a group of order 100 has a normal subgroup of order 25. The first 
theorem guarantees a 5-Sylow subgroup of order 25. The number of such 
subgroups is congruent to | modulo 5 and divides 4 by the third theorem. 
Thus, there is only one such subgroup of order 25. Since this subgroup equals 
each of its conjugates, it must be normal. 


59a. Let G bea group of order pq, where p and q are prime and p < q. Show 
that G is cyclic when g # 1 mod p. What can be said of Gwheng = 1 mod p? 


59B. Show that a group of order 2p, where p is prime, is either cyclic or 
isomorphic to the dihedral group D,. 
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59y. Suppose that G is a finite group with a normal subgroup H of order 
p*, where p is prime and k > 0. Show that H is contained in every p-Sylow 
subgroup of G. 


595. Let H be a normal subgroup of finite index in a group G. Show that if 
p*|[G: H], where p is prime, then G contains a subgroup K such that 
[K: H] = p*. 

59e. A group is simple if it has only itself and the trivial group as normal 
subgroups. (For example, groups of prime order are obviouslysimple.) Prove 
that a group of order 30 cannot be simple. 


59¢. Show that a group of order p79, where p and q are primes, is not 
simple (59¢). 


59n. Show that a simple group of order less than 60 is of prime order. (We 
shall see later that there does exist a simple group of order 60, the alternating 


group A, .) 


590. Let G bea finite group with just one p-Sylow subgroup for each prime 
p dividing o(G). Show that G is isomorphic to the direct product of all its 
Sylow subgroups. (See 26 and 26v.) 


59. Let p bea prime such that p* divides the order of the finite group G. 
Prove that the number of subgroups of order p* in Gis congruent to | modulo p. 
(This is a theorem of Georg Frobenius (1849-1917), and it is only fair to 
warn that the solution is somewhat lengthy.) 


sroup Homomorphism 
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60. A homomorphism of groups is a mapping from the set of elements of 
one group to the set of elements of another which preserves multiplication. In 
other words, a mapping ¢: G >G’ is a group homomorphism if G and G’ are 
groups and if for all x, ye G, f(xy) = (x)\(py). 

A group homomorphism ¢:G-—G is called an endomorphism (of the 
group G). The identity mapping |g:G—G of any group G is clearly an 
endomorphism. 

The composition of homomorphisms is again a homomorphism. Indeed, if 
o@:G—>G and : G’>G" are group homomorphisms, then for all x, ye G 
we have 


46 2 Group Theory 


(WOxy) = W(O(xy)) = W(Px)OyY)) = W(Px (Py) = (WO) x)(WO)y), 
and therefore ~~: G—G" is a homomorphism. 


60x. Show that a group homomorphism preserves identity elements and 
inverses. That is, show that de = e’ and that ¢(g~') = (¢g)~! for any homo- 
morphism ¢: G > G’ of groups with identity elements e and e’. 


60B. Let H be a normal subgroup of a group G. Show that the mapping 
o¢:G—>G/H given by ¢g = JH is a group homomorphism. 


607. Show that a group G is abelian if and only if the mapping ¢: G~G 
given by ¢g = g7' is an endomorphism of G. 


605. Show that a group G is abelian if and only if the mapping ¢: G>G 
given by ¢g = g” is an endomorphism of G. 


60e. Show that a group G is abelian if and only if the mapping ¢:G x G>G 
given by ¢(a, b) = ab is a group homomorphism. 


60¢. Determine the number of distinct homomorphisms ¢: Z,,— Z, in 
terms of m and n. 


61. Proposition. A mapping ¢: Z— Z is an endomorphism of the additive 
group of integers if and only if there is an integer k such that $(n) =kn 
for allneé Z. 


Proof. Let ke Z and ¢(n) =kn for all ne Z. Then for all n, me Z we 
have 


o(n +m) =k(n+ m) =kn+km= ¢(n) + O(™m), 
so that ¢ is a group homomorphism from Z to Z. Suppose on the other hand 
that @: Z— Z is an endomorphism. Let k = (1). Sincen = 1 + +::(n)-:- +1 
for a positive integer n, we have 
P(n) = (1) +°°+()°*° + 60) H K+ (+k =k. 
Also, 
p(1) = o(1 + 0) = p(1) + (0) 
implies ¢(0) = 0 = k - 0, from which it follows that 
g(-—1)=—k and ¢(—n) = —kn 


for any positive n. Thus, for all n € Z we have (n) = kn. 
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62. An isomorphism ¢:G-— G’' is a one-to-one correspondence which pre- 
serves group multiplication. An isomorphism is therefore a very special kind 
of homomorphism. If ¢: GG’ is an isomorphism, it is easy to show (as 
the reader should) that the inverse mapping ¢~': G’ > G preserves multipli- 
cation and is again an isomorphism. 

A group G is isomorphic to a group G’ if there exists an isomorphism 
go: GG’. This is denoted by writing G = G’ or ¢: GG’. Isomorphism is 
an equivalence relation among groups: 


(1) Gx G for any group G; 
(2) GG’ implies G’ = G; and 
(3) G=G', GG" imply Gx G". 


Isomorphic groups have the same structure and the same group-theoretic 
properties. In a sense, group theory is the study of those properties of groups 
which are preserved under isomorphism. 


62a. Let R denote the group of all real numbers under addition and let 
R* denote the group of all positive real numbers under multiplication. Show 
that the mapping ¢: R— R®* given by $x =e" is an isomorphism. What is 
the inverse of ¢? 


62B. Let ¢: Z,,— Zj7 be given by $[k],¢ = [3"],,. Show that @ is an 
isomorphism. (See 33 and 34.) 


62y. Show that a group of order 8 is isomorphic to Z, (33), D4, (268), 
Q (461), Z, x Z,, or Z, x Z, x Z,. 


625. Show that a group of order p?, where p is prime, is isomorphic to Zi: 
or Z, x Z,. 


62e. Let G denote the group of real numbers between —1 and +1 under 
the operation x: y =(x+ y)/(1 + xy). Show that G is isomorphic to the 
group of real numbers R under addition. 


62¢. Show that every finite abelian group is isomorphic to a direct product 
of cyclic groups. 


63. Proposition. A group G of order n is cyclic if and only if G = Z,. 


Proof. \f Gis acyclic group of order n(43), then G = {e = a®°,a’,...,a"~"}. 
Recall that Z,, the additive group of integers modulo #, may be represented 
as the set {(0],, [1],,---, [# — 1],} (33). An isomorphism @: G— Z, is given 
by ¢(a*) = [k],, since ¢ is clearly a one-to-one correspondence and 


pla’a’) = pal") = [i + kK) = Lin + (kl, = $(a’) + Ga’). 


48 2 Group Theory 


(This isomorphism is not unique—it depends on the choice of a generator 
for G.) 

On the other hand, suppose G is a group and @¢: Z, > G an isomorphism. 
Let a= $[I],. If g EG, then g = ¢[k], for some [k], € Z,, and therefore 


9 = blk}, = OCU, + °° (A) + [1,) = (PEL) = 


This shows that every element of G is a power of a and hence that G is cyclic. 


64. An automorphism of a group is an isomorphism of the group with itself. 
The identity |g: GG of a group is an automorphism of G, and in general 
there are many others. 

Let a be an element of a group G, and let «,: G— G be the mapping given 
by «,(g) = a~ ‘ga for every g € G. Then a, is a homomorphism since 


a(99') = a” 'gg'a = (a 'ga)(a"'g'a) = (a9) \(.(9'))- 


To prove that «, is an isomorphism, and hence an automorphism, it is 
sufficient to observe that a,-1 is an inverse for «,. Thus, to each element 
ae G there is assigned an automorphism «,. Such automorphisms are called 
inner automorphisms. All other automorphisms (if there are any) are called 
outer automorphisms. 

If S is a subset of a group G and aeéG, then the conjugate of S by a is the 
set 

S* = {xe G|axa' eS} 


(45). It follows that S*=«a,(S), since xeS* means axa~'=yeS, or 


x =a™~'yaea,(S) and vice versa. The point is that we may use this to give a 
variation of the definition of normal subgroup: a subgroup H of a group G is 
normal if «,(H) = H for all ae G. (This is the source of the term invariant 
subgroup for normal subgroups.) 


64a. Show that the set /(G) of all automorphisms of a group G is a group 
under composition and that the set .4(G) of inner automorphisms of a group 
G isa normal subgroup of AG). (The quotient group .(G)/.A(G) is called the 
group of outer automorphisms of G, which is a misnomer because the elements 
of &(G)/.4(G) are not outer automorphisms nor do the outer automorphisms 
themselves form a group.) 


648. Show that the quotient of a group by its center is isomorphic to its 
group of inner automorphisms, that is, G/Zg ~ A(G). 


64y. Show that for any finite group G, o0(G) > 2 implies o(.e(G)) > 1. 


645. Determine the number of distinct automorphisms of the groups Z,, 
and D,,. 
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64e. A subgroup H of a group G is characteristic if ¢(H) =H for all 
¢@ € S(G). Show that K a characteristic subgroup of H and H a charac- 
teristic subgroup of G imply that K is a characteristic subgroup of G. Show 
that K characteristic in H and H normal in G imply K normal in G. 


64¢. Determine the group #(Z, x Z,). 


65. If ¢:G—G’ is a group homomorphism, then the kernel of @ is the set 
Ker @ = {xe G| px =e’ € G’}, 


where e’ is the identity element of G’. In other words the kernel of ¢ is the set 
of elements of G which are mapped into the identity element e’ of G’. Since 
x = xe for any x EG, we have ox = (px)(¢e), which implies ge = e’. (Why?) 
Therefore, e € Ker ¢ and Ker ¢ is not empty. 


Proposition. For any homomorphism of groups ¢: G > G’, Ker ¢ is a normal 
subgroup of G. 


Proof. First, we note that for all xe G, 


$(x~") = G(x)! 
1 = e implies 
(P(x) (Ox ")) = $(e) = e’. 
From this it follows that if x e Ker ¢, then 
d(x!) = (d(x) t= el =e’, 
so that x_* € Ker @. If x, ye Ker @, then 
P(xy) = (Px)\(Py) = ee’ =e’, 


so that xy € Ker ¢. Thus, Ker @ is a subgroup of G. 

Let x e (Ker ¢)’, the conjugate of Ker ¢ by aeG. Then, axa” 'e Ker 
and (¢a)(¢x)(¢a~') = e' which implies dx = e’ and xe Ker @, or in other 
words, (Ker ¢)° < Ker @. On the other hand, if x € Ker ¢, then axa~' € Kero 
and x € (Ker ¢)’, or Ker ¢ c (Ker ¢)’. Thus, Ker ¢ is normal. 


because xx” 


Proposition. A homomorphism ¢: G—G' is one to one if and only if 
Ker ¢ = {e}. 


Proof. \f @ is one to one, then Ker¢@ = @ ‘{e’} can contain but one 
element of G, and since ¢(e) = e’, it must be that Ker ¢ = {e}. Conversely, if 
Ker ¢ = {e} and ¢(a) = ¢(b) for a,b eG, then 


o(ab') = $(a)o(b)' =e’, 
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so that ab~! € Ker ¢, ab™' =e, and a =b. This shows that ¢ is one to one. 
(Why ?) 


A homomorphism which is one to one is called a monomorphism. 


66. The image (11) of a homomorphism ¢: G— G’ is the set 
Im $ = {xe G' |x = G(y), y € G}. 


Proposition. For any homomor phism of groups @: G > G’, Im @ is a subgroup 
of G’. 


The proof is a simple exercise for the reader. 


Proposition. Ifo¢:G-—>G' is a group homomorphism, then 
G/Ker¢@ ~Im@. 


Proof. We recall that the quotient group G/Ker ¢ is the set of left cosets 
a(Ker d) with the product a(Ker ¢)b(Ker ¢) = (ab)(Ker ¢). (See 47.) Let 
B: G/Ker ¢-Im @ be the mapping given by f(a(Ker ¢)) = d(a). We must 
first check that B is well defined: a given element a(Ker ¢) of G/Ker ¢@ may be 
written in many ways, but the definition of £ is given in terms of a specific 
expression. Suppose that a(Ker ¢) = b(Ker ¢). Then bea(Ker@) and 
b = ac, where cé Ker @. It then follows that 


(5) = b(ac) = $(a)P(c) = Pale’ = H(@). 


Thus, the value of 8 on a coset does not depend on the specific representation 
of the coset—it is the same for all representations. 

It is clear from the definition of f that it is a homomorphism, and it only 
remains to see that B is a one-to-one correspondence. If xeIm @, then 
x = $(y) for some ye G, and x = B(y(Ker ¢)). Therefore, B is onto. On the 
other hand, it is immediate from the formula for B that Ker B is the set with 
the single element, Ker @ (considered as the coset of e). Therefore, B is one 
to one. 


67. A homomorphism ¢: G— G' is onto if and only if Im ¢@=G' (11). A 
homomorphism which is onto is called an epimorphism. 


Proposition. Every group homomorphism @¢:G—G' can be factored in the 
form $ = aBy, where 
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(1) a: Im ¢ >G’ is a monomorphism, 
(2) B: G/Ker ¢ > Im @ is an isomorphism, 
(3) y: G> G/Ker ¢ is an epimorphism. 


Proof. The mapping a is simply the inclusion of the subset Im ¢ into G’: 
it assigns to the element x e Im @, the same element x € G’; a is clearly one to 
one, a monomorphism. The mapping f is the isomorphism discussed in 66. 
The mapping y is given by y(a) = a(Ker ¢)—that is, y assigns to the element 
aeéG, the coset a(Ker ¢), which belongs to G/Ker ¢. Since every coset of 
Ker @¢ is the coset of each of its elements, the mapping y is onto. It is clear that 
y is a homomorphism: 


y(ab) = (ab)(Ker ¢) = (a(Ker $))(b(Ker $)) = (ya)(y2). 
Thus, y is an epimorphism. 


We can summarize the content of this proposition in Figure 4. 


? , 
G G 
OY a 
G/Ker ¢ ee 
B 
Figure 4 


67a. Let R denote the group of real numbers under addition and C* the 
group of nonzero complex numbers under multiplication. Decompose the 
homomorphism ¢: R > C* given by ¢x = e?"* in the manner of the proposi- 
tion above. 


67B. For any group G let ¢: G > W(G) be given by og = a, -: (64). Decom- 
pose @ in the manner of the proposition above. 


67y. Let d: Z— Z,, be the homomorphism given by ¢(k) = [nk],,. Decom- 
pose @ as above. 


675. Decompose the endomorphism ¢: K > K of the circle group K given 
by $z = 2". 
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68. The First Isomorphism Theorem. If H and N are normal subgroups of a 
group G, and N cH, then H/N is a normal subgroup of G/N and there is 
an isomor phism 


(G/N)/(H/N) = G/H. 


Proof. We define a mapping ¢: G/N > G/H by ¢(aN) = aH. Clearly ¢ is 
well defined; furthermore ¢ is a homomorphism, since 


d(aNbN) = g(abN) = abH = (aH)(bH) = g(aN)g(oN). 
Now 
Ker ¢ = {aN € G/N | ¢(aN) = H} = {aN €G/N|aeH}=A/N. 


Therefore by 65, H/N is a normal subgroup of G/N. Furthermore, ¢ is onto 
(why ?), and by 66, 


(G/N)/(H/N) = (G/N)/Ker ¢ = Im ¢ = G/H. 


69. The Second Isomorphism Theorem. If H and N are subgroups of a group 
G, and N is normal in G, then there is an isomorphism of groups, 


HNN ~H/(H ON). 


Proof. Tacit in the statement of this theorem are the statements: 
(1) HN is a normal subgroup of H; (2) HN is a subgroup of G; and 
(3) N is anormal subgroup of HN. We leave the proofs of these statements to 
the reader. (466—46c.) 

We define a mapping ¢: HN/N > H/(H aN) by 


(AN) = h(H 0 N). 


(Note that AnN = hN—that is, for any element Ane HN, the coset modulo 
N of hn is the same as that of h.) We must verify that 


(1) @ is well defined: suppose hN =h'N; then hk’ =hn for ne N, and 
n=h"'h'€ HAN; and consequently, h(H 1 N)=h'(H ON). 

(2) @ is onto: for any h(H 7 N)€ H/(H 7 N), we have h(H J N) = G(AN). 

(3) @ is one to one: Ker @ is the set of cosets hNe HN/N for which 
h(H AN)=HAQAN, which occurs only when hE HANCWN and 
AN=N. 


Remark. Since HN = NH, we also have NH/N = H/(H c N). 
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70. The Third Isomor phism Theorem. (Zassenhaus.) If H, and H, are 
subgroups of a group G and if N, and N, are normal subgroups of H, and 
H,, respectively, then there are isomorphisms 


N,(H, 0 H2) H, OH, — N2(A1 9 A2) 
N(H, ON.) (Hy ON2)(N,O Hz) NN, 7 A) 


Proof. Because of symmetry, only the first isomorphism need be proved. 
We leave it to the reader to verify that the requisite subgroups are normal. 
(See exercise 46v.) Setting H = H, 0 H, and N=N,(H, AN,), we apply 69 
to obtain isomorphisms 


N,(H, 0 N2)(H; OH) NH | a, H, fi, 
N,(H, 0 N,) ~ N HAN H,OH,0N,(H,ON,) 


Exercise 350 states that if X, Y, and Z are subgroups of a group and Yc X, 
then X \ YZ = Y(X CZ). This implies: 
(1) N,(H,y 0 N2)(A, 0 H2) = N,(H, 9 Ap); 
(take X = H,, Y=H, AN,,Z=H),); 
(2) Hy 0 H,0N(A, 0 N2) = (Hy ON2)(N, 9 A); 
fake X =H, HH, Y>H,0N>2,Z =N,)- 


The proof is now complete. 
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71. A normal series for a finite group G is a sequence of subgroups of G, 
ie 0, @ OC, a — Ge =O, 


such that G;_, is a proper normal subgroup of G; for i= 1, 2,...,. The 
factors of a normal series are the quotient groups G,/Gp, G2/G,,..., G,/G,_,. 
A refinement of a normal series is a normal series which contains all the 
subgroups of the original normal series (and perhaps more). A refinement 
which is not identical with the original series is called a proper refinement. 


72. The Schreier-Zassenhaus Theorem. Two normal series for a finite group 
have refinements of equal length whose factor groups are isomorphic. 
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Proof. Suppose that 
{fe} = Gp eG, c-''cG,=G (1) 
and 
{e} =H o CH, c*-CH,=G (2) 


are two normal series for the finite group G. We form a new series of sub- 
groups of G, 


(e} = Gy €G; e -* aig ae. (3) 


by setting G, = G,(G,4, © H,) fork =qm +r, whereO0 <q <nand0<r<m. 
Note that 


Gam = G(Gi41 0 Ho) = Gi_(G, A -H,,) =e 


Thus, we see that G, is well defined and that each group of the original series 
(1) occurs in series (3). Furthermore, each group G, is clearly a normal sub- 
group of its successor G, , ,. However, series (3) need not be a normal series— 
we may have G, = G,,, for some values of k. 

Similarly, we form another new series of subgroups of G, 


{é} = Aye Ayo & Hing ie (4) 


by setting A, = AA g+,9G,) for k=qn+r, where 0<q<m and 
0 <r<n. Remarks similar to those about series (3) apply to series (4). 

Now we see that series (3) and (4) have isomorphic factors. In fact for 
k =um+vand/=vn + u, we have by (70) 


Gea Gi Gig OH) Ao Gee Ay 


eee = — eden ented: OO pebeeeeiet  —— ——d 


G, 7 GAG4.0 Ea - AUG OGe 7 A, ' 


Finally, we obtain refinements of the normal series (1) and (2) by eliminating 
the redundancies from series (3) and (4). Since (3) and (4) have isomorphic 
factors, it follows that they have the same number of redundancies. Conse- 
quently, the refinements of (1) and (2) obtained from them have the same 
length and isomorphic factors. 


73. A composition series for a finite group is a normal series which has no 
proper refinements. The following theorem is an almost immediate conse- 
quence of the Schreier-Zassenhaus theorem. 


The Jordan-Holder Theorem. Two composition series for a finite group have 
the same length and isomorphic factors. 
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Proof. By the preceding theorem two composition series have refinements 
of equal length with isomorphic factors. However, the refinements must be 
identical with the original series, which by hypothesis have no proper 
refinements. 


73a. Show that every finite group actually has a composition series. 


73B. Define the /ength I(G) of a finite group G to be the length (number of 
factors) of a composition series for G. Show that H< G implies that 


1(G) = I(H) + (G/H). 


73y. Let G be an abelian group of order n = p}'p}? --: p,* where each p; is 
prime. Show that ((G) =v, +v, + -"' + %. 


74. A group is simple if it has for normal subgroups only itself and the 
trivial group. For example, groups of prime order are necessarily simple, and 
there are many others. Our interest here in simple groups is due to the 
following result. 


Proposition. A normal series is a composition series if and only if each factor 
group is a simple group. 


Proof. Suppose that G is a finite group and that 
{e}=G)oG,c::'cG,=G (*) 


is a normal series for G. If (*) is not a composition series, then we can obtain 
a proper refinement of (*) by inserting a new group G’ into the series at some 
point, say G, < G’cG,,,. It follows that G’/G, is a normal subgroup of 
G,4,/G, and that G,,,/G, is not a simple group. On the other hand, if 
G,,+;/G, is not simple for some k, then there is a normal subgroup G’, 


{e} 26" — Gy 44/G,. 


It follows that G” = G'/G,, where G,< G’ <G,,,, and G’ is a normal 
subgroup of G,,,. (Why?) Then (*) has a proper refinement and is not a 
composition series. 


Corollary. A group whose order is a power of a prime p has a composition 
series in which each factor is cyclic of order p. 


This is immediate from (52) and the proposition above. 


74a. Construct a composition series for the dihedral group Dg, (356). 
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74B. Construct composition series for the groups Z,, D,, and Q (461). 


75. Solvable Groups. A finite groupis solvable if it has a composition series 
in which each factor is a cyclic group. Since these factors must also be simple 
groups (74), they must all have prime order. Solvable groups are connected 
with the solvability of equations in radicals (139-149), which explains the 
unexpected terminology. We already know that every group of prime power 
order is solvable. We shall use the following results in Chapter 4. 


Theorem. Let H be a proper normal subgroup of a group G. Then G is 
solvable if and only if H and G/H are solvable. 


Proof. The normal series {e} c H < G may be refined to a composition 
series for G, 


{ep —CociG, <=: +-SG ae 
Suppose that G, = H. Then composition series for H and G/H are given by 
{e} =G)oG,c::-cG,=H 
and 
{e} = G,/H < Gya1/H c++ © G,/H = G/H. 
Furthermore, by the first isomorphism theorem (68) we have for i > k, 
(Gj41/H)/(G,/H) © G;44/G;. 


Thus, each factor of the composition series for G is a factor of either the 
composition series for H or that for G/H. Now the theorem follows imme- 
diately. 


Corollary. Every finite abelian group is solvable. 


Proof. By induction on the order. The solvability of groups of orders 1, 2, 
and 3 is clear. As induction hypothesis we assume the solvability of all abelian 
groups with order below n. Let G be an abelian group of order n. There is 
some prime p which divides n. By Cauchy’s theorem (55), G has an element 
of order p. Since G is abelian, this element generates a normal subgroup H 
of order p. If p=n and H =G, we are finished because a group of prime 
order is clearly solvable. If p <n, then H and G/H (which has order n/p) are 
solvable and applying the theorem, G is solvable. This completes the induction 
step and the proof of the corollary. 
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75a. Prove that any subgroup of a solvable group is solvable. 


75B. Prove that a group is solvable if it has a normal series whose factor 
groups are solvable. 


75y. Prove that a group with order below 60 is solvable. (See 59n.) 
758. Prove that a direct product of solvable groups is solvable. 


75e. Prove that a group is solvable which has just one p-Sylow subgroup 
for each prime p dividing its order. 


The Symmetric Groups 


The symmetric groups (30) are of such great importance in the Galois 
theory that we make a special study of their properties. 


76. Let X denote a finite set. A permutation of X is a one-to-one onto map- 
ping from X to X. The set »(X) of all permutations of X is a group in a 
natural way: if S, T € #(X), then ST € x(X) is the composite mapping, given 
by (ST)x = S(Tx) for xe X; the inverse of Se .(X) is just the inverse 
mapping S~!. A subgroup of .(X) will be called a group of permutations 
of X. 

A permutation group is a special kind of transformation group (53). If G 
is a group of permutations of the finite set X, then the action of G on X is 
given by g * x = g(x). This action satisfies: 


(1) g * (A * x) = (gh) * x for all g, hE G and all xe X; 
(2) ex«x=x forall xeX; 
(3) ifg*x =x for all xe X, then g =e. 


Only conditions (1) and (2) are required of transformation groups in general. 
An action of G on X which satisfies (3) is called effective, or alternatively, G 
is said to act effectively. It is clear that we could have made the definition: a 
permutation group is a group which acts effectively on a finite set. (See 53y.) 


77. The structure of the full group of permutations /(X) of a finite set X 
is completely determined by the number of elements of X. To be precise, a one- 
to-one correspondence ¢: X — Y induces an isomorphism ®: #(X) + A(Y) 
given by ®(S) = ¢Sd™. Suppose now that X is a set with n elements 
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and that w: X +N, is a one-to-one correspondence of X with the set 
N, ={1,2,..., 2}. (We might call w an ordering of X.) Then w: X -N, 
induces an isomorphism Q: (X)— &N,). However, »(N,) is simply 
S,, the symmetric group on n letters, defined in 30. In other words we have: 
if X is aset with exactly n elements, then s(X) is isomorphic to S,, the sym- 
metric group on n letters. 

As a consequence, the study of permutation groups is reduced to the study 
of the symmetric groups and their subgroups. Every finite group can be 
viewed as a permutation group of its own set of elements and, consequently, is 
isomorphic with a subgroup of S, where n is the order of the group. These 
observations indicate the significance of the symmetric groups. 


77a. Show that every group may be considered as a group of permutations 
of its underlying set. (This is known as Cayley’s theorem after the English 
mathematician Arthur Cayley (1821-1895), who was the first to consider 
abstract groups.) 


78. Let z: N,N, be anelement of S,, and let 2, denote z(k). One way 
of expressing the permutation 7 is by a tableau 


t= 5 
a Se 
Clearly the order of elements in the top row is immaterial. Note that the 
inverse of z is 
=|. (Ra oy “Oe 
oem is 2 Bahay ) 


This notation, used by Cauchy in his early studies of permutation groups, is 
needlessly complex. Each element of N, appears twice, and no advantage is 
taken of the order in which the elements are written. We shall develop a 
more efficient notation in which every permutation is written as a product of 
““cycles” in a unique way. 

The tableau notation makes it clear that the order of S, is n!. The element 
m, may be chosen in n ways; once 7, is chosen, there are n — | possibilities 
for m,; when 7, 2,, ..., 7; have been chosen, there remain n — i possible 
ways to choose 7;,,. Thus, there are n-(n — 1)-+-: 2-1 =n! ways in which 
the bottom row of the tableau may be chosen, and thus there are n! elements 
ofS, - 


78a. Determine the number of permutations z which leave no element 
fixed, that is, for which x, #k for k =1,2,...,n. (This is a famous, but 
difficult, problem which was first solved by Nicolas Bernoulli (1687-1759), and 
later, independently, by Euler.) 
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79, Let a, a2,...,4€N, be distinct integers. We shall denote by 
(a,,a,,..., a,) the permutation 

a, a> eee a, S58 i) less 

a, a, ee a, eee a eee 
which carries a, to a,, a, toa3,..., and a, to a,, leaving all the other elements 
of N,, fixed. We call (a,,a,,...,a,) a cyclic permutation of order k or a 
k-cycle. This notation is almost too efficient: (a,, a2, ..., @,) can denote an 


element of any one of the groups S, for which n> k. 

A cyclic permutation of order 2, (a,, a2), simply interchanges a, and a, 
and is called a transposition. 

Two cyclic permutations (a,,a,,...,a,) and (b,,b,,..., 5) are disjoint 
if they have no entries in common. Disjoint cyclic permutations commute, 
that is 


Meera 06, 6), 6 129 O)= (0905 ..- FO a}; Bs, --- 5 G): 


However the groups S, are not abelian for n > 2. 


79a. Compute the number of distinct k-cycles in S,,. 


79B. Show that if 2eS, and n>2, then there exists a transposition t 
such that tz 4 mt (unless, of course, z is the identity element of S,). This 
shows that S, has trivial center for n > 2. 


797. Prove that disjoint cyclic permutations commute. 
795. Show that S, contains (2) subgroups isomorphic to S, x S,-,, all of 


which are conjugates. (sec 20y for definition of the binomial coefficient (;) 


80. Theorem. Every permutation of n letters is the product of disjoint cyclic 
permutations in exactly one way (except for order of the factors). 


Proof. LetneS,,. Weshall denote by H the cyclic subgroup of S, generated 
by z. H acts on the set N, = {1, 2, 3,..., n} dividing it into disjoint orbits, 
X,,X2,..-.,X,. In other words, two elements i and j of N, belong to the 
same orbit if and only if j = n‘(i) for some power n* of x. In any orbit X, we 
may list elements in order 


Ay, Ayr, ++ +5 As,» 


so that a 5+; = 7(Q;) and a,, = 1(a,s,). (Why is this possible?) We let «, 
denote the cyclic permutation (a,1, Q2,-.+,Qs,). We claim that m = a,a,...a,. 
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To prove this we need only show that z and a,a, --- a, have the same effect 
on any element xEN,,. If xe X,, then a(x) = x for i# k and a,(x) = n(x). 
Therefore, 


(H1%2 °** &)(x) = % (x) = n(x). 
Finally, the expression z = «,a, ‘+: a, is clearly unique except for the order 


of the a;’s. 


Note that we may include or exclude factors of the form a, =(m) since 
every 1-cycle is the identity. 

In practice it is a simple matter to express a permutation as the product of 
disjoint cyclic permutations. For example 


i 2a 4 Od 
(; OLN a2 i748 3)(2, 6)(4, 7). 


Corollary. If a,,02,...,4,€S, are disjoint cyclic permutations, then the 
order-of a0, °°: 4, is the least common multiple of the orders of the factors. 


Proof. Let k; denote the order of «;, and let k be the least common 
multiple of the k;. Since the «;’s commute, we have 


(Gay ~'4)*= vioh ee, Ne 


so that o(a,a,-::a,) divides k. Since the «; are disjoint, it follows that 
(a,a, ++: a,)'=e implies «; =e for each i. Then k;|/ for each i, and thus 
k|/. In particular k | o(a,0, -*: a), and therefore o(a,a, -*: a.) =k. 


Corollary. Every permutation is a product of transpositions. 


Proof. It is enough to show that every cyclic permutation is a product of 
transpositions. This is easy because 


(ay, ,..., A) = (@,, a)(@,, a1) *** (@y, a2). 


800. Let eS, be written as a product «,a, --:a, of disjoint cycles, where 
for convenience we assume that 


o(a) = o(a) 2° = o(a,). 


We shall call the decreasing sequence of natural numbers o(@,), o(a,), ..., 
o(a,,) the form of x. Show that two elements of S, are conjugate if and only 
if they have the same form. 


80B. Compute the number of elements which have the form k,, k,,..., Km. 
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80y. Show that S, is generated by the transpositions 
2) 25 Sage ney Ue— "T, ). 
805. Show that S, is generated by the cycles (1, 2) and (1, 2,..., 7). 


81. Even and Odd Permutations. Let #, denote the polynomial of n 
variables x,, x,,..., x, which is the product of all the factors x; — x; with 
i <j; that is, 
PA Xugts, -»-5X,) = al (x; — x)). 
i<j 
The symmetric group S, acts on the polynomial A, by permuting the variables. 
For zé S, we have 


PA Xnit) ’ Xn(2) 2) tos) Xa) = (sgn TI a X25 +205 Xa), 


where sgn x = +1. If the sign is positive, then z is called an even permutation; 
if the sign is negative, then z is called an odd permutation. (Sgn is an abbrevia- 
tion of the Latin word signum.) 

It is not difficult to see that sgn(xo) = (sgn n)(sgn a). This means that: 


the product of two even or two odd permutations is even, 
the product of an even and an odd permutation is odd. 


It follows that the set of even permutations is a subgroup of S,, called the 
alternating group on n letters and customarily denoted A,. We may regard 
sgn: S, + K,asa homomorphism from S, to K, = {+1}, the group of square 
roots of unity (44). This shows that A, is a normal subgroup of S, and the 
quotient group S,/A, is isomorphic to K,. 

It is immediate that o(A,) = n!/2. 


8la. Determine the sign of a k-cycle in terms of k. 


82. Proposition. Every even permutation of n letters, n> 3, is the product 
of cyclic permutations of order 3. 


Proof. There are no cyclic permutations of order 3 in S, for n =1 or 
n = 2, but the identity is always an even permutation. Thus, the proposition 
is false for n < 3. Our proof will be inductive, beginning with n = 3. 

The even permutations of 3 letters are the identity e and the 3-cycles 
(1, 2, 3) and (1, 3, 2). Thus, we have easily disposed of the case n = 3. 

Now suppose the proposition proved for even permutations of less than n 
letters, and suppose z € A,,. The permutation o = (z,, ,/)+ 7, where x; =n, 
satisfies 


a(n) =(x,, 7, 1) + n(n) =(x,, 7", i)(n,) =7N 
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and is even. Since a leaves n fixed and is even, o may be considered as an 
even permutation of the letters 1, 2,...,—1. By the inductive hypothesis 
o is the product of 3-cycles, say o = a,a, ‘+: a,. Setting %) =(z,, i, n), we 
have 


Ay HA °° Ao = yoo = (Tn, L n)(t,; nh, i) “N=, 
and we have expressed z as a product of 3-cycles. 


82a. Show that the alternating group A, is generated by the 3-cycles (1, #, n) 
OPT SB, By.ccoy tt = Ile 


83. Theorem. The alternating group A, is simple except for n = 4. 


Proof. Recall that a group is simple if it has only itself and the trivial 
group as normal subgroups. For n < 4 the order of A, is either 1 or 3, and 
A, is obviously simple. The major part of the proof is the case n > 4. 

Let N be a nontrivial normal subgroup of A, for n > 4. We must show that 
N =A,,. The first step is to see that N contains a 3-cycle. 

Let « # e be an element of N which leaves fixed as many elements of N,, as 
possible. As guaranteed by 89, let 


C—O ou eee 


where the «; are disjoint cycles, which we may assume are given in order of 
decreasing length. Renumbering if necessary, we may assume that 


Se= (le 2h hs He 
and, when s > 1, that 
a, =(kK+1,k+2,...,/). 


We distinguish several cases. 

Case 1. « moves each of the numbers 1, 2,3, 4, 5. (This occurs when s > 2, 
when s=2 and «=(1,2,...,k)(kK+1,k +2,...,/) with />4, or when 
s=1anda=a, =(1,2,...,k) for k > 4.) Setting B = (3, 4, 5), the element 
B-'a-*B belongs to the normal subgroup N, and thus B~ ‘a7 'fa e N. How- 
ever, it is easily checked that the permutation B~ '«~ 1Ba leaves the number 1 
fixed in addition to leaving fixed all the elements fixed by a. This contradicts 
the choice of a, and case | is impossible. 

Case 2. a moves the numbers 1, 2, 3, 4 and no others. (This occurs only when 
a = (1, 23, 4), since (1,2,3,4) is an odd permutation.) Again we set 
B = (3, 4, 5) and argue that the element B~'«~ Ba belongs to N. However, 
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direct computation shows that B~1a~'Ba = (3, 4, 5) = B. Thus, Be N and B 
moves fewer elements than a. This contradiction eliminates case 2. 

Case 3. « moves the numbers 1, 2,3 and no others. (This occurs only when 
a =(1,2,3).) There are no other cases now that the first and second are 
eliminated. Thus, we have shown that N contains a 3-cycle, which we may 
assume to be (1, 2, 3). 

It remains to show that N contains every 3-cycle. Choose an even permu- 


tation 
Ne i jig ovolf® 
Then, o(1, 2, 3)o~* = (i, j, kK) belongs to the normal subgroup N. Varying i, /, 


and k, we obtain all 3-cycles. Thus, N contains every 3-cycle, and in view of 
82, N =A,, and we are finished. 


Remark. The group A, is not simple: it contains a normal subgroup of 
order 4 containing the elements e, (1, 2)(3, 4), (1, 3)(2,4), and (1, 4)(2, 3). 


83a. Verify that the set 
N => {e, el, 2G, 4), (i, 3)(2, 4), di, 4)(2, 3)} 


is a normal subgroup of A,. Show that K = {e, (1, 2)(3, 4)} is normal in N 
but not in A,. (This shows that a normal subgroup of a normal subgroup need 
not be normal in the whole group.) 


838. Show that A, has no subgroup of order 6. 


84. Theorem. For n> 4, the symmetric group S, is not solvable. 


Proof. Since the groups K, ~ S,/A, and A, (for n > 4) are simple, the 
normal series 


{e} CA, CS, 


is a composition series for S when n > 4. However, A, is not abelian for 
n> 3. (For example, (1, 2, 3)(2, 3, 4) = (1, 2)(3, 4) while (2, 3, 4)(1, 2, 3) = 
(1, 3)(2, 4).) Consequently, A, is not cyclic for n > 3. As a result S, is not 
solvable for n > 4. 


84a. Construct composition series for the groups S,,S,, and S,, and 
verify that these groups are solvable. 


848. Show that for n> 4 the only normal subgroups of S, are {e}, A,, and 
S,, itself. 
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85. We have already remarked that a finite group can be viewed as a 
permutation group of its set of elements. We can improve this as follows. 


Theorem. If H is a subgroup of a finite group G and H contains no nontrivial 
normal subgroup of G, then G is isomorphic to a subgroup of (G/H), the 
group of permutations of the set G/H. 


Proof. Define a homomorphism ¢: G > x(G/H) by setting 
&(9XxH) =(gx)H all xeG. 


Ker ¢ is a normal subgroup of G (65). An element g belongs to Ker @ if and 
only if (gx)H = xH for all x € G, or what is the same thing, x~ ‘gx e€ H for all 
x €G. In other words, Ker ¢ is the intersection of H and all its conjugates. 
Thus, Ker ¢ CH and by hypothesis Ker ¢@ must be trivial. It follows that 
Gz Im ¢. 


Corollary. Forn > 4, A, is the only proper subgroup of index less thanninS,,. 


Proof. It follows from 846 that for n > 4, A, is the only proper, nontrivial, 
normal subgroup of S,,. Suppose that H is a subgroup of S, and [S,: H] <x. 
If [S,: H] = 2, then H is normal and H = A, On the other hand [S,: H] > 2 
implies A, ¢ H. Thus, the hypothesis of the theorem is satisfied, and S, is 
isomorphic to a subgroup of &(S,/H). However, 


of (S,/H)) = [S,: A]! < n! = ofS,), 


which is a contradiction. 


85a. Let f(x;, x,,...,x,) be a function of n variables, n > 4. Let v denote 
the number of distinct functions obtained when the variables x,, x.,..., X, 
are permuted. Show that v > 2 implies v > n. (The general problem of what 
can be said about the number v is classic and one of the motivating ideas 
for group theory. Results of this type were given by many early group 
theorists, including LaGrange, Ruffini, Abel, Cauchy, and Galois.) 


86. A subgroup H of S, is transitive if for every pair of elements i, 7 EN, 
there is an element 7€H such that x(i) =/. For example, the cycle « = 
(1, 2,...,) generates a transitive subgroup of S,: the element o/~! carries i 
to 7. The following theorem about transitive subgroups will be needed to 
establish unsolvability of quintic equations in general. 


Theorem. Let H be a transitive subgroup of S, where p is a prime number. If 
H contains a transposition, then H = S,. 
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Proof. We may assume without loss of generality that (1, 2) is the trans- 
position H contains. An equivalence relation on the set N, = {1, 2, va eh 
is defined by i ~ / if and only if the transposition (i, 7) € H. From the transi- 
tivity of H it follows that each equivalence class has the same number of 
elements; in fact, if ¢ ¢ H and ¢, = ¢(1) =i, then ¢ yields a one-to-one cor- 
respondence from the equivalence class of | to that of i since (1,°k) € H if 
and only if 


Gi, bx) = (1, Ox) = 6 (1, K) OE. 


The number s of elements in any equivalence class must divide the prime p, 
and thus s = | or s = p. However, the equivalence class of | contains at least 
the two elements | and 2. Consequently, there can be only one equivalence 
class which has p elements. In other words, H contains all the transpositions 
of S,. Since every permutation is a product of transpositions (80), we have 
H=S,. 


Field Theory 


Chapter 3 


A field is an algebraic structure in which the four rational operations, addi- 
tion, subtraction, multiplication, and division, can be performed and in 
which these operations satisfy most of the familiar rules of arithmetical 
operations with numbers. In the formal definition of field structure, we assume 
only that addition and multiplication are given; subtraction and division are 
defined as the inverse operations. Division by 0 is automatically prohibited by 
the definition. 

Field theory is the theoretical background for the theory of equations. It 
does not make sense to ask, for example, whether the equation x? + x + 1 =0 
is solvable, without specifying the field in which we want thesolutions tolie. 
If we specify the field to be the set R of all real numbers, then the equation 
x? + x +1=0 has no solutions, which is to say, there are no real numbers 
satisfying this equation. On the other hand there are complex numbers (the 
cube roots of unity, @ and w”) which do satisfy this equation in the field C of 
all complex numbers. 

From an abstract viewpoint the theory of equations is just the study of 
field theory. In this chapter we present the basic field theory which is needed for 
Galois theory in the next chapter. To illustrate the depth of field theory, we 
take up the ancient problem of constructibility of geometric figures with 
straightedge and compass and prove that, in general, angles are not trisect- 
able in this way. 


66 


Definition and Examples of Field Structure 67 


Definition and 
sxamples of Field Structure 


87. A field is a set F with two operations (called addition and multiplication) 
which assign to each ordered pair (a, b) of elements from F, two elements of F, 
called their sum, a + b, and their product, ab, in such a way that: 


(1) Fis an abelian group under addition (with identity element 0); 

(2) F*, the set of nonzero elements of F, is an abelian group under multi- 
plication; 

(3) multiplication is distributive over addition; that is, for any three 
elements a, b, ce F, 


a(b + c) =ab+ac and (a + b)c = ac + be. 


As customary in the additive notation for abelian groups, we shall denote 
the additive inverse of ae F by —a. If ae F*, then a has a multiplicative 
inverse as well, and we denote this by a~‘ or I/a. We always denote the multi- 
plicative identity element (identity element of the group F*) by 1. 


87a. Show that 0a = 0 = a0 for any element a of a field F. 
87B. Show that (—1)a = —a for any element a of a field F. 


877. Letaandb be elements of a field F such that ab = 0. Show that a = 0 
or b =0. 


875. Ifaand b are elements of a field Fand b ¥ 0, let a/b denote ab~'. Show 
that when a # 0, |/(a/b) = b/a. For a, ce F and b, de F* prove the rule for 
addition of fractions: 


87s. Construct a field with four elements. 


876. Let F be a field and let E = F x F. Define addition and multiplication 
in E by the rules: 


(a, b) + (c,d) =(a+c,b+d) 
and 
(a, b\(c, d) = (ac — bd, ad + bc). 


Determine conditions on -F under which E (with these operations) is a field. 
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87n. A mapping ¢: FE of fields is a field homomorphism if it preserves 
addition and multiplication—that is, 


p(a + b)= (ha) + (pb) and ¢(ab) = (pa)($b). 


Show that a field homomorphism is always one to one or trivial (every element 
mapped to zero). Explain why an onto field homomorphism is a field isomor- 
phism. (Field isomorphism has the obvious meaning.) 


88. The set C of all complex numbers is a field under the usual rules for 
addition and multiplication of complex numbers. Similarly, the set R of all 
real numbers and the set Q of all rational numbers are fields. 

A subfield of a field E is a subset Fsuch that (1) Fis a subgroup of E under 
addition and (2) F* = F — {0} is a subgroup of £* under multiplication. Of 
course a subfield of a field is itself a field. 

Clearly, Q and R are subfields of C, and Q is a subfield of R. We shall use 
the term number field to designate subfields of C. Thus, Q and R are number 
fields. Before long we shall have many other examples of number fields—in 
fact they will be the main source of examples and applications for field theory. 


88a. Show that a subset F of a field E is a subfield if and only if F* is non- 
empty and a, b € F implies a — b € F and (when b #0) a/be F. 
88B. Show that the set 

Q(,/2) = {ze C|z =a + b,/2; a, bE Q} 


is a number field. 
88y. Show that the set 

Q(i) = {ze C|z=a+ bi; a, be Q} 
is a number field. 


885. Prove that every number field contains Q. 


89. The Prime Fields Z, . In 34 we observed that for p a prime number, the 
elements of Z,, except [0],, form a group Z, under multiplication. Clearly, 
Z, is abelian. Furthermore, we have 
[a],((5], + [c],) = (a],[b + c], = [a(b + c)], = [ab + ac], 
= [ab], + [ac], = [a],[b], + [],[cl,, 
which shows that multiplication is distributive over addition in Z, . Therefore, 
when p is prime Z, is a field. 
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89a. Let F bea field. Ifae FandneN, we let na denote the element of F 
obtained by adding a to itself 7 times. Thus, 2a =a+ a, 3a=a+a+a,and 
so forth. We say that a field F has characteristic 0 if forae Fandne N,na=0 
implies a= 0. If F does not have characteristic 0, then na =0 for some 
aeé F* and some vn EN. In this case we define the characteristic of F (denoted 
| char F) to be the smallest natural number » such that na = 0 for someae F*. 
Show that char F = n implies that na = 0 for a/l ae F*, and that » is prime. 


89B. Show that a field F has characteristic 0 if and only if there exists a 
one-to-one field homomorphism ¢: Q — F; show also that char F = p if and 
only if there exists a one-to-one field homomorphism ¢: Z, > F. 


| 89y. Show that for a field F of nonzero characteristic p the mapping @: F> F 
given by ¢a = a? is a field homomorphism. Show that ¢ is an isomorphism 
when F is finite. 
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ases, and Dimension 


90. A vector space over a field F is an additive abelian group £, together 
with an operation (called scalar multiplication) which assigns to each element 
céF (called a scalar) and each element we E (called a vector) an element 
cae E in such a way that: 


(1) c(do) = (cd)x for any three elements c, de F and ae E; 
(2) (c+ d)a = ca + da for any elements c, dé F and we E; 
(3) cia +B) =ca + cf for any elements ce F and a, fe E; 
(4) la =a for any element oe E. 


The study of vector spaces is called /inear algebra. The next few articles will 
discuss some elementary concepts of linear algebra. 


91. Suppose that E is a vector space over F. A finite set of clements of E£, 


A = {4%,,%,---,%m}, 18 linearly dependent over F if there is a linear relation, 
C)H%y Hep hy + °° + C,, 0, = 0, (*) 
with coefficients c,,C2,..., ¢, € F, not all zero. If there is no relation of the 


form (*) among the x,;’s except the trivial one (in which all coefficients c; are 
zero), then the set 4 is /inearly independent over F. 
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We extend the notion of linear independence to infinite sets as follows: an 
infinite subset of E is linearly independent over F if each one of its finite 
subsets is linearly independent over F. 


92. Suppose that E is a vector space over F. A set S of elements of E is a 
spanning set for E over Fif every element of E can be written as a linear com- 
bination, 


C\O, + C€,02 + 2 AG. Ope 


of elements ¢,, ¢,,..., 0, € S and coefficients c,, c,,..., ¢, € F. 
If there exists a finite spanning set for E over F, then E is called a finite 
dimensional vector space over F. 


92a. Let S be any subset of a vector space E over F. Show that the set of 
vectors E’ of E which can be written as linear combinations of vectors in S is a 
vector space over F. (E’ is said to be a subspace of E. See 926.) 


92B. A subset E’ of a vector space E over Fis a subspace of E’ if every linear 
combination of vectors in E’ belongs to E’. Show that a subspace of a vector 
space is again a vector space (over the same field). 


92y. A linear transformation from a vector space E over a field F to a vector 
space E’ over Fis a mapping T: E > E’ such that 


T(a + B) = (Ta) + (7B) and T(ca) = c(Tx) 
for all vectors w and B and every scalar c in F. Show that the sets 
Kery = {ae E|Ta=0} and Im7= {a €£ |e = iaige 


are subspaces of E and E’, respectively. 


93. Again suppose that E is a vector space over F. A basis for E over Fis a 
minimal spanning set for E over F. Explicitly, a set B of elements of E is a 
basis for E over F if 


(1) Bis a spanning set for E over F, 
(2) no proper subset of B spans E over F. 


Proposition. A basis is linearly independent. 


Proof. Let {B,, B,,..., B,} be a finite subset of a basis B for E over F. 
We shall see that if {8,, 6,,..., B,} were linearly dependent, B could not 
be minimal. Suppose that 


CB, + C2 B22 +°°' + CB, =0 (*) 
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were a linear relation among the f;’s with c,, c.,..., c, € F, not all zero. We 
may suppose, without loss of generality, that c, #0. Then it would follow 


from («) that 
n= ()a-G)a—- Ee 


in other words, that f, is a linear combination of the elements B,, B;,..., B, 
with coefficients in F. It is not hard to see that since B spans E over F, 
the set B — {B,} would also span E over F. However, this would show that B 
does not satisfy condition (2) and is not a basis. Therefore it must be true that 
every finite subset of B is linearly independent and that B itself is linearly 
independent. 


Proposition. A linearly independent spanning set is a basis. 


Proof. Suppose that B is a linearly independent spanning set for E over 
F. If B is not a basis, then some proper subset S of B also spans E 
over F. Choose Be B—S. Since S spans E over F, there are elements 
G,, G5, --.,0,,€.5 and coefficients c,, cy, .-., C_ € F such that 


= C705 SP G55) ae FOSS CSG 


However, this implies that the set {8, 0,, 02, ..., 0,,}, Which is a finite subset 
of B, is linearly dependent over F. This contradicts the linear independence of 
B. Thus, B must be a basis. 


94. Proposition. If B isa basis for E over F, then every element of E may be 
written uniquely as a linear combination of elements of B with coefficients in F. 


Proof. We have only to prove uniqueness since B spans E over F. Suppose 
that B e Ecan be written in two ways as a linear combination of elements of B, 
say 


P= Ba + C2 Po he": + 6, Be, 
B=4,B, + d,B, +°::+4,B,, 
where we assume without loss of generality that the same elements, B,, B2,..., 


B,, of B are involved in the linear combinations. Now subtracting the two 
expressions above, we have 


(c; — d)B, + (c2 — da)B, + °° + (c, — 4,)B, = 9. 


Linear independence of the set {8,,82,...,8,} implies that c;=d; for 
i=], 2,..., a. Uniqueness is proved. 
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95. Proposition. If E is a finite dimensional vector space over F, then every 
basis for E over F is finite and all such bases have the same number of elements. 


Proof. Since E is finite dimensional over F (92), there is a finite spanning 
set for E over F from which a finite basis can be selected. (How?) Let B= 
{B1, B2,---» B,} be such a finite basis, and let A be any other basis. 

Choose an element a, € A. Let a, be written as 


Oy = Cy fy Co fie + °° 7: G8. c,€ F. (1) 


Since a, is a basis element and therefore nonzero, not all the coefficients c; 
are zero. Without loss of generality we may suppose that c, #0. Now we 
claim that the set B, = {a,, B,,..., B,}, in which a, has replaced £,, is again 
a basis for E over F. Since 


By = ey a — 2B. —**' — Bn (2) 


every element of E can be written first in terms of the basis elements B,, B, , 
..., 6,3; then the right hand side of (2) can be substituted for B, yielding an 
expression in terms of the elements «,, 8,,..., B,. This shows that B, spans 
E over F. It remains to show that B, is linearly independent. Suppose now 
that 


da, + d,B,+-:'+4,B, =0, d,eé F, (3) 


is a linear relation among the elements of B,. Substituting (1) into (3) gives a 
relation among the elements of B: 


(4yc1)B, + (dy + dycr)B. +++ + (Gi + de,)B, = 9. (4) 


The linear independence of B implies that the coefficients in (4) are all zero, 
from which we conclude that d, = 0 (since c, #0 by assumption above), 
and consequently, that d, = d, = -:- =d,=0. We have shown that B, isa 
linearly independent spanning set for E over F and thus a basis by (93). 

Next choose an element «, € A which is not a scalar multiple of a,. (If no 
such a, exists, it must be that nm = 1 and the argument is finished.) We can 
write 


Ot, =cya, +28, + °°: +0,8,, c,e€ F. 


Since «, is linearly independent of a, not all the coefficients c,, ¢3,..., c, are 
zero. Without loss of generality, we may assume that c, # 0. Now we claim 
that the set B, = {a,, a,, B3,..., B,} is a basis for E over F, which is proved 
by an argument like that above. 
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Continuing in this fashion, we arrive at a basis B, = {a,,@,,...,a,} made 
up entirely of elements from A. Since A is a basis and minimal, it follows that 
a. — A. 


The number of elements in a basis for E over F is called the dimension of E 
over F and is denoted [E: F']. 


95a. Prove that a subspace E’ of a finite dimensional vector space E over F is 
again finite dimensional and that [E’:F] < [E: F]. 


95p. Let E’ bea subspace of a vector space E over F. An equivalence relation 
on E is defined by « = B mod E’ if and only if a — B € E’, and we denote the 
quotient set of this equivalence relation by E/E’. Show that E/E’ is a vector 
space over F. Show that E finite dimensional implies that E/E’ is finite dimen- 
sional. 


95y. With the same hypothesis as in 95B, show that when E is finite dimen- 
sional the dimension of E is the sum of the dimensions of E’ and E/E’. 


Extension Fields 


96. A field E is called an extension (field) of a field F if F is a subfield of E. 
This additional terminology seems superfluous, and technically it is. It reveals, 
however, a difference in modes of thought between field theory and group 
theory. In group theory we are often interested in determining the sub- 
structure of a group whereas in field theory we are more interested in what 
superstructures a field can support. Frequently we shall extend a field by 
adjoining to it additional elements. 


Proposition. An extension field E of a field F is a vector space over F. 


Proof. Clearly, E is an abelian group under addition. Scalar multiplication 
of an element c € F and an element a E is defined as the product ca where 
both c and a are considered as elements of E. Now the four properties required 
of scalar multiplication (90) are immediate consequences of E being a field. 


When an extension field E of a field F is a finite dimensional vector space 
over F, we shall refer to E as a finite extension of F and to the dimension 
[E: F] as the degree of E over F. 
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Finally, we note that a sequence of extension fields 
be he ee 
is called a tower of fields, and Fo is called the ground field. 


96a. Show that the degree of Q(,/2) over Q is 2 (88). 
96B. Show that [C: R] =2. 


96y. Let w = e?*/3 = —4 + 4,/—3, Show that the set 
Q(w) = {ze C|z=a+ bw; a,beEQ} 
is an extension field of Q of degree 2. 


965. Show that a finite field (that is, a field with a finite number of elements) 
of characteristic p (89a) has p” elements for some n. 


97. Proposition. If D is a finite extension of E and E is a finite extension of F, 
then D is a finite extension of F. Furthermore, 


[D: F] = [D: EJ[E: F]. 


Proof. Let A = {a,,%2,...,@,} be a basis for E over F, and let B= 
{B,, B2,--+, By} be a basis for D over E. We shall show that the set 


C= {a,8,|lsism,l<j<n} 


is a basis for D over F. 
(1) C spans D over F. Suppose y € D. Using the basis B, we write 


Vie MPa ty 2ih ot: noe ae 
Each of the elements y; € E, i= 1, 2,..., , can be written as 
Yi Cis che Cia tea el Cole Gi OF 
Substituting these expressions into the one above yields 
n m 
tp : pan a; B;, Cy er. 
— j= 


(2) Cis linearly independent. Suppose that there is a linear relation among 
the elements of C with coefficients in F, 


Me: 


m 
yi c;%; 8; = 0, Ci ea. 
1 j=1 


i 
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We may regard this as a linear relation among the elements of B with coeffi- 
cients y; = )7=, ¢;;%;, which must be zero since B is a basis and linearly 
independent. On the other hand, the linear relations 


imply that all the coefficients c;; are zero for 
Nee in ae) 


We have shown that C is a linearly independent spanning set for D over F; 
hence it is a basis. Since C has a finite number of elements, it follows that D isa 
finite extension of F. Finally, we have 


[D: F] =nm = (D: E)[E: F). 


98. A polynomial over a field F in the indeterminate x is an expression of the 
form 


Paar cer i ges 


where Co, C;,-.., C, are elements of F, called coefficients of the polynomial. 
Polynomials are completely determined by their coefficients, which is to say, 
two polynomials over F in x are equal if their corresponding coefficients are 
equal. 

The phrase “‘an expression of the form” in a mathematical definition is 
hardly consonant with modern standards of rigor, and we shall eventually 
give a more precise treatment of polynomials (156). 

We shall usually denote polynomials by a single letter such as f and write 
an equation such as 


Ie = £6 Gk + et 


to specify the coefficients. Then, /(2), f(x’), f(y + 1), etc., will indicate the 
corresponding expressions in which 2, x?, y + 1, etc. have been substituted 
for x. The largest number k for which c, # 0 is called the degree of f (denoted 
deg /), and c, is called the /eading coefficient of f. If all the coefficients of fare 
zero, we write f= 0 and do not assign a degree tof. 
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Polynomials are added and multiplied just as in elementary algebra, and we 
have 


deg(f + g) < max {deg f, deg g}, 
deg( fg) = deg f + deg g, 


whenever the polynomials involved are nonzero. 

If f is a polynomial over F in x and ae F, then fa is an element of F. The 
assignment a— fa defines a function F > F, which we shall denote / It may 
happen, however, that two distinct polynomials define the same function. For 
example, the polynomials x and x? over Z, have this property. (Why ?) 

F[x] will denote the set of all polynomials in x over F. A polynomial of the 
form fx = is called a constant polynomial and will sometimes be identified 
with the corresponding element ce F. In this way we may view F as a subset 
of F[x]. 


98a. A rational function over a field F in the indeterminate x is a (formal) 
quotient p/q of polynomials p and g over F. Two such quotients, p/q and r/s, 
are equal if and only if ps = qr in F[x]. We denote the set of all rational func- 
tions of x over F by F(x). We identify a rational function p/1 (where | is the 
constant polynomial fx = 1) with the polynomial p. Thus, F[x] is identified 
with a subset of F(x). (Sometimes polynomials are called integral functions.) 
Show that F(x) is a field under the operations defined by 


aL Lo 
me ae G) \s} | ae 


99. The Division Theorem for Polynomials. If f and g are polynomials over F 
and g #0, then there exist over F unique polynomials q and r such that 
f=qg+-r and either r=0 or deg r < deg g. 


Proof. Let R denote the set of all polynomials over F which have the form 
f—4g for some polynomial q over F. If R contains 0, the polynomial with 
all coefficients zero, we set r=0 =f—qg, and we are finished except for 
uniqueness. 

Suppose’then 0 ¢ R. Then the set 


S={neN|n=degh,heR} 


is nonempty since either deg fe S or, when / = 0, deg g € S. Therefore, S has a 
smallest element m. By definition of S we have m = deg r for some re€ R, and 
by definition of R we have r = f— qg for some q. In other words, f= qg + r, 
and it remains to show that deg r < deg g. Suppose that m = degr = degg =n. 
Clearly, there is an element c € F such that the polynomial s given by 


EE eeeeoeoeeoer———=—_—_ a 
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sx = rx — cx" "(gx) = fx — [(ax) + (cx"~")I(gx) 


has degree m — | or less. However, s € Rand this contradicts the minimality of 
m. It must be that deg r < deg g. 

Suppose q’ and r’ are polynomials satisfying the same conditions as q and r. 
Then f=qg +r=q'g +r’ implies 


qQ-q))g=r'—r. 


If g —q' # 0, then taking degrees on both sides of this last equation we must 
have 


deg(q — q') + deg g = max {deg r’, deg r}. 


This implies that either deg g < degr’ or degg < degr, both of which are 
wrong. Thus, we must have g =q’ and, consequently, r=r’. This proves 
uniqueness. 


In practice it is not difficult to determine q and r: we simply carry out the 
customary long division of f by g obtaining qg as the quotient and r as the 
remainder. If r = 0, then we say that g divides f and we write g | f. 


Corollary. (The Remainder Theorem) If f isa polynomial over the field F and 
a is an element of F, then there is a unique polynomial q over F such that 


fx = (x — a(qx) + (fa). 
Proof. Applying the division theorem with g given by gx = x — a, we have 


Sx = (x — a)(qx) + (rx) 


where r = 0 or deg r < deg g = 1. Thus, rx is a constant, and taking x = a 
shows that rx = fa. 


99a. Indicate the changes needed in the proof of the theorem above to prove 
the following: if f and g are polynomials with integer coefficients and g is 
monic, then there are unique polynomials q and r with integer coefficients such 
that f = qg + r where either r = 0 or deg r < deg g. (A polynomial is monic if 
the leading coefficient is 1.) 


100. An element « of the field Fis a root of the polynomial f over F if fa = 0. 
In other words, a is a root of fif f, considered as a function, assigns the 
value 0 to «. 

The remainder theorem just proved implies: if x is a root of f, then fx = 
(x — «)(gx), or in other words, (x — a) divides f. 
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Proposition. A polynomial of degree n over the field F has at most n roots in F. 


Proof. The proof is by induction on n. To start the induction, we note that 
a polynomial of degree 0 is a (nonzero) constant and has no roots. Now 
suppose the proposition is true for polynomials of degree less than n. Let fbea 
polynomial of degree n. If f has no roots in F, then we are finished. If fhas a 
root « € F, then /x = (x — a)(qx) for some polynomial q over F of degree n — 1. 
Then q has at most n — | roots in F; it clearly follows that fcan have at most 
n roots since a root of f is either a or a root of q. 


A polynomial of degree n over the field F which-has all n roots in F is said 
to split over F. Clearly, a polynomial f of degree n splits over F if and only if it 
can be factored as 


Fx = c(% — a) — Og) +++ (% — Oy) 


where ,, &,..., &, € F are the roots of f- 


Theorem. The multiplicative group F* of a finite field F is cyclic. 


Proof. Since F* is abelian, each of its Sylow subgroups is normal, and 
therefore, for a prime dividing the order of F*, there is just one Sylow 
subgroup. It follows from 590 that F* is the direct product of its Sylow 
subgroups. Furthermore, the orders of the Sylow subgroups are relatively 
prime, and therefore, it follows from 43y that F* is cyclic if each Sylow 
subgroup is. To see this, let H denote the p-Sylow subgroup for a prime p 
dividing the order of F*, and let « be an element of maximal order, say p*, in 
H. Then the order of every element of H must divide p*. From the preceding 
proposition it follows that there cannot be more than p* elements of F 
satisfying the equation x™ = 1, and thus o(H) = p* = o(a), from which we 
conclude that H is cyclic. 

Exercise 100e suggests an alternate proof of this theorem. 


100a. The formal derivative of the polynomial fx = co + cyx +++: +," is 
the polynomial f’x = c, + 2c,x +--+ +nc,x"~'. Verify the rules of formal 
differentiation: (f+ 9)’ =f’ +g’ and (fg) =f'9 + fg’. 


1008. Show that a polynomial / over a field F and its derivative f’ have a 
common root « in F if and only if « is a multiple root of f, that is, (x — «)? 
divides f. 


100y. Show that there exists one and only one polynomial of degree n or less 
over a field F which assumes n + 1 prescribed values fay = Bo, fa, = fi, ...; 
fa, = B, where a, &,,..., %, are distinct elements of F. (The expression for fis 
called the Lagrange interpolation formula.) 


1005. Show that every element of a finite field with q elements is a root of the 
polynomial x? — x. 
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100e. By counting the number of elements of F which are roots of x” — | for 
various values of n, show that the multiplicative group F* of a finite field F is 
cyclic. (See 25a.) 


100¢. For pa prime construct a group isomorphism Z,_, > Z,. 


100n. For p a prime show that (p — 1)! = —1 mod p. (This is known as 
Wilson’s theorem after Sir John Wilson (1741-1793), who was a student of 
Edward Waring (1736-1798). The statement, but not the proof, of this theorem 
first appeared in Waring’s Meditationes Algebraicae of 1770 (p. 218). The first 
published proof is due to Lagrange in 1771. Lagrange also proved the con- 
verse: if (7 — 1)! = —1 mod 4, then n is prime.) 


1000. Show that a polynomial fx = cg + cyx + +++ +.¢,x" over Z has a root 
pilq € Q, where p, q € Z and (p,q) = 1, only if p|co and q|c,. 


1001. Let fbe a polynomial over a field F whose derivative (100«) is 0. Show 
that if char F = 0, then f is a constant polynomial. What can one say in the 
case where char F #0? 


101. In general a polynomial of degree n over a field may have any number 
of roots from 0 to x in that field. A notable exception to this occurs for the 
field of complex numbers C. 


The Fundamental Theorem of Algebra. A polynomial of positive degree over 
the field C of complex numbers has a root in C. 


Proof. Unfortunately, all proofs of this theorem use analysis and therefore 
are not really algebraic. We shall give a proof due to Ankeny which uses the 
theorem of Cauchy from complex function theory. (This proof is included for 
completeness only and may be skipped by the reader unfamiliar with complex 
function theory.) 

Let f be a polynomial of degree n > | over C given by 


Ge 6 Cie ye 


We let f denote the polynomial of degree n over C whose coefficients are the 
complex conjugates of those of /; that is, 


fret, $62 f** +e, 


Now the product ¢ = ff is a polynomial of degree 2” over C with real coeffi- 
cients. (Why?) We observe that it is sufficient to prove that ¢ has a root: if 
gu = (fx)( fx) =0, then either fx = 0 (and f has x as a root) or fx = 0 and 
fa=0. 
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Suppose ¢ has no root in C. Then the complex function 1/@ is analytic in 
the whole complex plane. It follows from Cauchy’s theorem that the integral 
of 1/¢ along any path in the plane depends only upon the endpoints of the 
path. In particular, the integral of 1/@ around the upper half (I) of the circle 
|z| = R in a clockwise direction equals the integral of 1/¢ along the real axis 
from —R to R. 


Figure 5 


We examine the behavior of these integrals as R grows large. Since 
deg @ = 2n, we can write $(z) = az?" — (z) where a # 0 and y is a polynomial 
of degree less than 2n. Consequently, we have 


o@)|,,_ |¥@] 


az2" az2" 


Suppose now that W(z) = ay + ayz +++: + a,,2” where m < 2n. Then 


W(z) 


az" 


ec l4ol + lai Izl + + °° + lal 121" 
r |a| |z|?" 
(laol + lay] + +++ + lanl) 
[a \z|?7>" 


at least when |z| > 1. It follows that for any ¢ € (0,1), there exists some R, > | 
such that |z| > R, implies |W(z)/az?"| < e, and also 


|p(z)| = laz?"\(1 — e) = JalR?"(1 — 8). 
We apply this to J, dz/¢(z) for R= R, to get 


dz 
$@)| 


|dz| <{ |dz| 1 
rld(z)l~ Jr lalR*"(1 — 2) jalR?"-*(1 — 8) 


Thus, as R grows large, J, dz/¢(z) grows small in absolute value. 
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Now consider the other integral. Since ¢ has real coefficients, it takes only 
real values along the real axis. Furthermore, ¢ cannot change sign along the 
real axis—to do so it would have to vanish somewhere, contrary to the hypo- 
thesis that ¢@ has no roots. It follows that the integral of 1/d along the x axis, 
which can be expressed as 


+R dx 
roy 


can only increase in absolute value as R grows. Of course this behavior is 
completely opposite to that of the integral around [ and provides the con- 
tradiction which establishes the theorem. 


Corollary. A polynomial f of degree n over C has n roots in C and factors as 
Fx = c(% — a )(% — a) +++ (X — &), 
where a,, %2,..., &, are the roots of fandc éC*. 


Proof. By the theorem, f has a root a, €e C. Then fx = (x — «,)(gx) where 
g is a polynomial of degree n — 1. Again, g has a root a,¢€C and gx = 
(x — «2)(hx), and so forth. 


N.B. The roots a,,a,,..., 4, need not be distinct. A number which occurs 
more than once in the list of roots is called a repeated or multiple root. Those 
which occur once are called simple roots. 

The fundamental theorem of algebra was stated first in 1746 by the French- 
man Jean-le-Rond D’Alembert (1717-1783), who gave an incomplete proof. 
The first true proof was given by Karl Friedrich Gauss (1777-1855) in 1799. 
Gauss gave, in all, four distinct proofs of this theorem. 

A field with the property that every polynomial over it splits into linear 
factors is said to be algebraically closed. The fundamental theorem may there- 
fore be restated as: the field of complex numbers is algebraically closed. It is 
true, but difficult to prove, that every field is contained in an algebraically closed 
field. 


10la. Show that every polynomial over R of positive degree can be factored 
as a product of polynomials over R with degree | or 2. 


101B. A number « is a root of multiplicity m of a polynomial ¢ over C if 
(x-—a)"|\ox but (x-—a)"*' Pox. 


Show that a is a root of ¢ of multiplicity m if and only if da = ¢’a =::: = 
go" Ya = 0, but 6°a 4 0. (Here ™ denotes the k-th derivative of ¢.) 
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102. A polynomial g over the field F divides a polynomial f over F if f=qg 
for some polynomial q over F. To indicate that g divides f, we write g|/, and 
to indicate that it does not, gf. A polynomial is always divisible by itself and 
by every polynomial of degree 0. 

A polynomial f over F of positive degree which can be factored as f= gh 
where g and / are polynomials over F of positive degree is called reducible over 
F; a polynomial of positive degree which cannot be thus factored is called 
irreducible over F. (We shall not apply either term to polynomials of degree 
zero.) Every polynomial of degree | is irreducible. In general there are many 
irreducible polynomials of higher degrees over a field. As we shall see, irre- 
ducible polynomials are like prime numbers. 


Proposition. A polynomial f, irreducible over the field F, has a root in F if and 
only if deg f = 1. 


Proof. If deg f= 1, then fx = co +.c,x and fhas —co/c, as root in F. On 
the other hand, if f has a root ae F, then fx = (x — a)(qx) for some poly- 
nomial q over F. Since f is irreducible, it must be that deg g = 0 and, con- 
sequently, deg f= 1. 


Corollary. The only irreducible polynomials over the field of complex numbers 
C are those of degree |. 


As an example we note that the polynomial x? + 1 over the field of rational 
numbers Q is irreducible, but considered as a polynomial over C it is reducible: 
x? + 1=(x—i\(x +i). 


102a. Show that a polynomial irreducible over R has degree 1 or 2. 


102B. Showthatevery polynomial of positive degree over a field F is divisible 
by a polynomial irreducible over F. 


1027. Show that thereare an infinite number of irreducible polynomials over 
any field. 


1025. Compute the number of irreducible polynomials of degrees 1, 2, and 3 
over Z,. 


102e. Determine all of the monic polynomials (that is, polynomials with 
leading coefficient 1) of degrees 2 and 3 which are irreducible over Z;3. 


102¢. Show that the polynomial f over a field is irreducible if and only if the 
polynomial g defined by gx = f(x + a) is irreducible over the same field. 


102n. Show that 4x° — 3x — 1/2 is irreducible over Q. 


1020. An integer m is called a quadratic residue mod p if and only if the con- 
gruence x? = m mod p has a solution, or what is the same thing, if and only if 
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the polynomial x? — [m], has a root in the field Z,. Count the number of 
elements [m], € Z, for which x* — [m], has a root in Z,. 


1021. Show that x? — a has a root in Z, (p > 2) if and only if a?*!/? =a, 


102k. Determine conditions on a and b for which the quadratic equation 
x? + ax + b =0 is solvable in Z,. 


103. A greatest common divisor of two polynomials of positive degree over 
the field F is a polynomial of maximal degree over F dividing both. That is, 
dis a greatest common divisor of f and g if d|f and d|g, but deg h> deg d 
implies either h } f or h¥g. For a rather trivial reason, there is more than one 
polynomial which satisfies these requirements: if d is a greatest common 
divisor of f and g over F and cé€ F*, then the polynomial cd (given by (cd)x = 
c(dx)) is also a greatest common divisor. We shall let (/, g) denote the set of 
polynomials which are greatest common divisors of fand g over F. Outside of 
this aspect of the situation, the notion of greatest common divisor for poly- 
nomials is similar to that for integers (23). 


Theorem. If f and g are polynomials of positive degree over F and deé(f,Q), 
then there exist polynomials u and v over F such that 


d=uf+ vg. 


Proof. Let a denote the set of polynomials of the form sf+ tg over F. Let 
b= {ne N|n=degh, hea,h #0}. 


The set b contains deg f and deg g and therefore has a smallest element 
m = deg d’, where d’ = u'f + v'g Ea for some polynomials wu’ and v’ over F. 
We must have d’|/A for all h Ea. Otherwise, for some h, h = qd' + r where 
deg r < deg d’, and if h = sf + tg, we have 


r=h—qd' =(s —qu')f+(t—qv')g 


so that r € a, contradicting the minimality of deg d’ in b. Thus, d’ divides every 
element of a—in particular d’ |f and d’|g. Therefore, deg d’ < deg d. On the 
other hand, d|f and d|g so that d divides d’ = u'f + v’g and deg d < deg d’. 
Consequently, deg d = deg d’. However, d|d’ implies d’ = cd where ce F. 
Setting u = u'/c and v = u'/c, we have d= d'/c = uf + vg. 


Corollary. If d, d'e(f,g), then d' = cd for some nonzero cé F. 


Corollary. Iff,g, and hare polynomials over the field F, if f is irreducible over 
F, and if f divides gh, then f divides g, or f divides h (or both). 
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Proof. If fg, then | € (f, g) and by the theorem there exist polynomials 
u and v over F such that 1 = uf+ vg. Then h = ufh + vgh, and therefore f| h. 


103a. Formulate and prove for polynomials over a field F an analogue of 
the euclidean algorithm. (See 23¢.) 


1038. Let F[x] denote the set of polynomials in x over the field F..Given a 
polynomial q € F [x], we define an equivalence relation on F [x], called con- 
gruence modulo q, by f= g mod q if and only if q|(/— g). We shall denote the 
equivalence class of fe F [x] under this relation by [/], and the quotient set of 


F[x] by F[x]/(q). Show that the operations of addition and multiplication 
defined on F'[x]/(q) by 


(flt+tg,=U+9), (hla), = la, 


are well defined. Prove that they define a field structure on F [x]/(q) if and only 
if q is irreducible over F. 


103y. Letgq be a polynomial irreducible over the field F and let E denote the 
field F [x]/(q) (1038). Show how E may be viewed as an extension of F, and 
show that [E: F] = deg q. Show thatg, considered as a polynomial over E, has 
a root in E. 


1038. Show that the field R[x]/(x? + 1) is isomorphic to the field of com- 
plex numbers C. 


103. Show that the field Q[x]/(x? — 2) is isomorphic to the field Q(,/2). 


103¢. Deduce Kronecker’s theorem: For every polynomial f over a field F 
there exists an extension field E of finite degree over F in which f splits. (Use 
1028, 103B, and 103y.) 

Leopold Kronecker (1823-1891) is renowned for his work in quadratic 
forms and ideal theory. He is also famous for his remark, “‘ Die ganzen 
Zahlen hat der liebe Gott gemacht, alles anderes ist Menschenwerk.” (‘“‘ The 
whole numbers God has made, all else is man’s doing.’’) 


103n. Construct a field with p? elements where p is a prime, p > 2. 


1030. Prove that over a field of characteristic p (p #0) the polynomial 
x? — wis irreducible or splits. 


104. Unique Factorization Theorem for Polynomials. Every polynomial over 
a field can be factored as a product of irreducible polynomials in a way 
which is unique except for the order and multiplication of factors by constants. 


We shall omit the proof of this theorem, which is entirely similar to the 
proof of the fundamental theorem of arithmetic (24) and which is a special 
case of the proof of unique factorization in euclidean domains to be given 
later (172). 
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How a polynomial is factored as a product of irreducible polynomials 
depends of course upon the field with which it is associated. For example, 
x* + | is irreducible over Q and has no proper factorization over Q. However, 


over Q(,/2) (88B) we have 
x? EH? + / 2x + 1)? Se Fo), 
while over Q(i) (88y) we have 
Re cde? aKa: = 4): 


Furthermore, a polynomial over a field may be factored in several ways. For 
example, over Q we have 


x8 4 xt 4 x? + 1 = (x? + 1x4 +:21) = (2x* +: 2)4x? + 4). 


105. Let f be a polynomial over the field Q of rational numbers given by 
fx = Cy +0,x +++: +0,x". We shall call f primitive if 


(1) f#9, 
(2) the coefficients cy, c,, ..., C, are all integers, 
(3) the greatest common divisor of cg, c,,.--, C, iS 1. 


Proposition. The product of primitive polynomials is again a primitive poly- 
nomial. 


Proof. Let g and h be primitive polynomials given by 


GX = ay +a,X4+°°-+4,x 
and 
hx = by + Oyx 4+ °°+ + b,x. 


Suppose there is a prime p dividing all the coefficients of gh. Since g is primi- 
tive, p cannot divide all the coefficients of g. Let a; be the first coefficient of g 
not divisible by p. Similarly, let b; be the first coefficient of h not divisible by p. 
The coefficient of x'*/ in the polynomial gh is given by 


A554; + ge te Q;-,Dj44 ate a;b; ae Q;415;-, rem at Qj 4; D9. 


Since p divides ay, a,,..., @;-, and by, by, ..., bj, every term to the left 
and right of a; 5; is divisible by p. By hypothesis p divides the whole expression, 
and it follows that p divides a;b;. However, p|a;5,; implies p|a; or p|b,, 
which is a contradiction. Thus, no prime divides all the coefficients of gh. In 
other words, the coefficients of gh have greatest common divisor 1. 
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106. Proposition. Every nonzero polynomial f over the field Q of rational 
numbers can be written uniquely as f = cf where ce Q, c>0, and f is a 
primitive polynomial. 


The positive rational number c is called the content of f, and the polynomial 
f is called the primitive form of f. 


Proof. Clearly, f= af’ where ae Q, a>0, and f’ is a polynomial with 
integral coefficients. Let f be the polynomial obtained from’ by dividing each 
coefficient by the number b EN, which is the greatest common divisor of them 
all. Then f = cf where c = ab. 

Suppose f can be written in two ways: f = cf = dg where f and g are primi- 
tive. Let c= p/q and d=r/s where p, q, r, sé N. Then spf =qrg is a poly- 
nomial with integral coefficients having greatest common divisor sp = qr. It 
follows that c = p/q = r/s = d and, consequently, that f = g. 


Corollary. A nonzero polynomial with integral coefficients is reducible over 
Q if and only if it factors as a product of two polynomials with integral 
coefficients of positive degree. 


Proof. Suppose f is a polynomial over Z which is reducible over Q, say 
f=gh. Let g = ag and h= bh be the factorizations of g and A guaranteed by 
the proposition. Then f = (ab)gh is a factorization of f since gh is primitive. 
Therefore, ab is the content of f and therefore an integer. We have f= 
((ab)g)h, the required factorization. The argument in the other direction is 
trivial. 


This corollary is called Gauss’s lemma because it is given in article 42 of his 
famous Disquisitiones Arithmeticae of 1801. The proposition of 105 is also 
called Gauss’s lemma by some authors. 


107. The Eisenstein Irreducibility Criterion. Let f be a polynomial over Q 
with integral coefficients, say fx=Cyo+cyx+-+::+0c,x". If there is a 
prime number p such that p divides every coefficient of f except c, and 
p” does not divide cy, then f is irreducible over Q. 


Proof. Suppose that fis reducible. Then by the corollary of 106, f must 
factor as f = gh where g and / are polynomials of positive degree with integer 
coefficients. Let g and h be given by 


gGX=agtaxteo+ax 
and 


ix = Do + b,x + e+ DEX, r+s=nN,. 
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The coefficients are related by the equations 
Co = Ay bo, 


Cy = aod, + abo, 


Gap On +a,b,-4 + Saas + a,bo. 


By hypothesis, p|c and therefore p|ay or p|b,, but not both since p? / cy. 
Without loss of generality, we may assume that p|d) and p}by. Now p|c, 
and p| dp imply p|a,bo; since p ¥ by, it follows that p|a,. Continuing in this 
fashion, we obtain p| da ),p|a,,...,p|a,. Thus, p divides every coefficient of g. 
Since f = gh, it follows that p divides every coefficient of f. But this contra- 
dicts the hypothesis that p }c,. Thus, f cannot be reducible, and the proof is 
complete. 


Ferdinand Gotthold Max Eisenstein (1823-1852) was a student of Gauss 
and continued the work of the master begun in the Disquisitiones Arithmeticae. 
The theorem above appeared in Crelle’s Journal fiir Mathematik vol. 39 (1850), 
pp. 160-179. It is sometimes erroneously attributed to Theodor Sch6nemann 
(1812-1868). 
107a. Prove that there exist a countable number of irreducible polynomials 
of degree n over Q. 

1078. Show that the polynomial ®,x =1+4+x+--+-+x?~! is irreducible 
over Q for p a prime. (Hint: consider gx = ®,(x + 1).) 
1077. By means of the Eisenstein criterion, show that the cubic 4x* — 3x 
—1/2 is irreducible over Q. 
1075. Show that a polynomial of odd degree 2m + | over Z, 

2m+1 


Ue Meg t 1X + °° + Com 4 1X > 


is irreducible if there exists a prime p such that 


(1) PN Caner 

(2) P| Cmpt EP \\Chs-23'6 iP | Cams 
ORD ese) a ae a oe 
(4) le oe 


(This is a theorem of Eugen Netto (1846-1919) and appears in Mathema- 
tische Annalen, vol. 48 (1897).) 


107e. Let 
PX = (fox) + (Axx +°°° +(x) xX" 
be a polynomial in X over the field F(x) (98a) with coefficients in F[x]. Sup- 


pose that x divides fox, /,x,...,fn-,x but not f,x, and that x? does not 
divide fy x. Prove that @X is irreducible over F(x). 
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Algebraic Extensions 


108. Let E be an extension field of the field F. An element « of E is algebraic 
over F if « is aroot of some polynomial with coefficients in F. If every element 
of E is algebraic over F, then E is called an algebraic extension of F. 

As examples we note that v2 and i= ve 1 are algebraic over Q. Complex 
numbers which are algebraic over the rational field Q are called algebraic 
numbers. There exist complex numbers which are not algebraic (e and z for 
example) and these are called transcendental numbers. 


108«. Prove that the sum, c + a, and product, ce, of a rational number c and 
an algebraic number «@ are algebraic numbers. 


108B. Prove that cos(kz) is an algebraic number whenever k is rational. 


109. Let « be an element of the extension field E of the field F, and suppose « 
is algebraic over F. Among all the polynomials over F of which @ is a root, let 
f be one of lowest degree. Then fis called a minimal polynomial for a over F. 
Minimal polynomials have two important properties. 


Proposition. If fis a minimal polynomial for « over F, then 


(1) fis irreducible over F, 
(2) f divides any polynomial over F having « as a root. 


Proof. Suppose / is reducible, say f = gh. Then we have fa = (ga)(ha) = 0, 
which implies gz = 0 or ha =0. Both g and h have degree less than f, 
contradicting the definition of f as a minimal polynomial for a Thus, / is 
irreducible. 

Suppose « is a root of a polynomial g over F. By the division theorem we 
can write g=qf+r. Then we have ga = (qa)(fx) + re =0 which implies 
ra = 0. If r #0, then deg r < deg f. But then r is a polynomial of degree less 
than f with « as a root, contradicting the minimality of f, Thus, r = 0 and 


fg. 
Corollary. Two minimal polynomials for « over F differ by a constant factor. 


109a. Let f be a polynomial irreducible over F, and let E be an extension 
field of F in which f has a root «. Show that fis a minimal polynomial for « 
over F. 


1098. Let Fo Ec D bea tower of fields. Let « € D, and let g be a minimal 
polynomial for « over E and fa minimal polynomial for « over F. Show that 
g\f (considering both as polynomials over E). 


Algebraic Extensions 89 


109y. Find minimal polynomials over Q and Q(,/2) for the numbers 


,/2 + /3 and i,/2 = /—2. 


110. Let « be an element of £, an extension field of F. We denote by F(«) the 
smallest subfield of E containing both F and «. F(a) is called the field obtained 
by adjoining « to F. We may also characterize F(a) as the intersection of all the 
subfields of E which contain « and F. 


Proposition. If E is an extension field of F and « € E is algebraic over F, then 
F(a) is a finite extension of F of degree n wheren is the degree of a minimal 
polynomial for « over F. Furthermore, the set {1, a, «?,..., «"~'} is a basis 
for F(«) over F. 


Proof. Since F(«) is a field and contains «, it must contain all the elements 
1,«,a7,..., 0"! and therefore, as a vector space over F, it must contain every 
linear combination 


Cot cate +c,-0"7 3, 


with coefficients in F. Let X denote the set of all such linear combinations. It 
is not difficult to see that XY is a vector space over F spanned by 


Wye we ple 


Now we assert that {l,a,...,«"~'} is linearly independent over F. If 
there were a nontrivial linear relation over F, 


Cot cya t+ ++: +c,-,0""' =0, 
then « would be a root of the polynomial g over F given by 
Gee cy Fx t+ yt T!. 


However, deg g <n, and by hypothesis n is the degree of a minimal poly- 
nomial for a over F. This contradiction forces the conclusion that 


{L, &ey..e" *} 


is linearly independent and hence that it is a basis for X over F. 

The remainder of the proof consists of showing that X is a field. Since X 
contains F and «, this implies that F(a) c XY. We already know that X c F(a), 
so that we will have X = F(a). 

Clearly, X is an additive subgroup of E. To show that X is a subfield of E, 
and hence a field, we need only verify that ¥* = X — {0} is a multiplicative 
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subgroup of E* = E — {0}. Let f be a minimal polynomial for « over F. Sup- 
pose that 


P= bo. + bioete O eae 
and 


y=Cort cat?" Pele 


are elements of X*. Wecan write B = ga and y = ha for the polynomials g and 
h over F given by 


gx = bo + b,x + “a + b,, aoe 
and 


hie = Co 4n€yX Oak 


By the division theorem we have gh = q f+ rwherer = Oordegr < degf=n. 
Since fa = 0, we have 


0 # By = (ga)(he) = (gh)a = (qf)o + re = ra. 
Since ra # 0, we have r # 0, and consequently, deg r < n. Thus, 
By =ra=adgtaat-:'+a,-,0" *6 X*. 
Finally, we show that every element of X* has a multiplicative inverse. Let 
B=ge=bo + bat:::+,_,a" ! 


as above. By 109, the minimal polynomial f for « over F is irreducible. There- 
fore, 1 € (f, g) the greatest common divisor, and by 103 there exist polyno- 
mials uw and v over F such that uf + vg = 1. Moreover, we can find uw and v 
so that deg v < deg f= n. Since fa = 0, we obtain (va)(ga) = 1. Thus, B™* = 
vae X*. 


As an example of this proposition, consider the field Q(¢) where ¢ = e?"!/? 
for p prime. Now ¢ is ap-th root of unity, that is, a root of x? — 1, and there- 
fore is algebraic over Q. We have the factorization 


xP —1] =(x — 1)®,(x) 
where 
D(x) = xP + xP ee eT, 


It follows that ¢ is a root of ®,, which is irreducible over Q as we shall see. 
The substitution x = y + 1 yields 
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(y+ 1)? -1 
(v+1)-1 


= apa! Piep-2 4... iP P 
yrs (ite + (2 }y+(,7 5} 


to which the Eisenstein criterion applies using the prime p. Since ®, is irre- 
ducible, it must be the minimal polynomial for € over Q. Consequently, 


[Q(0): Q] =p — 1 and {1,f, 67, ..., €?-7} is a basis for Q(¢) over Q. 


110a. Let E be an extension field of F, and let « € E bean element algebraic 
over F. Show that F(«) is isomorphic to the field F [x]/(/) where fis a minimal 
polynomial of « over F. (See 1038.) 


Oy + 1) = 


110B. Let a, B € E be elements algebraic over the subfield F. It is clear that B 
is algebraic over F(a). We denote by F(a, f) the subfield of E obtained by ad- 
joining B to F(a). Show that F(a, 8) = F(f, «). What can be said of the 
degree [F(a, 8): F]? 


1107. Let E be anextension field of F which contains all the roots, a,,02,..., 
a, , of a polynomial f of degree n. The splitting field of fin E is the smallest sub- 
field of E containing F and the roots «,,a2,...,a,. We denote this by 
F(@,, @, .+.5,)- Prove that 


[a a4 ope oF | al. 


1105. Let E and E’ be two extensions of F in which a polynomial f over F 
splits. Prove that there exists an isomorphism @ from the splitting field of fin 
E to the splitting field of fin E’, such that ¢c = c for every ce F. 


110e. Let ae E be an element transcendental (that is, not algebraic) over a 
subfield F. Prove that F(«) is a field isomorphic to F(x), the field of rational 
functions of x over F (98a). 


110¢. Prove that two finite fields with the same number of elements are 
isomorphic. 


110n. Let ae E be anelement transcendental over the subfield F. What is the 
degree of F(a) over F(a*/4a> — 1)? 
111. Proposition. A finite extension is an algebraic extension. 


Proof. Let E be a finite extension of the field F, and suppose that 
[E: F] =n. Let a € E be any element. The set of n + 1 elements 


1h % O54) 
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must be linearly dependent. Therefore, there are elements cy, c,,...,¢, € F, 
not all zero, such that 
to $C,a+ °° +06,0" =0. 
Thus, « is a root of a polynomial over F and is algebraic. 


Corollary. If « is algebraic over F, then F(a) is an algebraic extension of F. 


Corollary. If F=FycF,c:::cF, is a tower of fields, all contained 
within an extension E of F, such that for i>0, F; = F;-,(a@;) where a; is 
algebraic over F;_,, then F,, is algebraic over F. 


Proof. Each F; is a finite extension of F;_,. By 97 we have that F, is a 
finite extension of Fy = F, and hence algebraic. 


112. Proposition. If E is an algebraic extension of F and D is an algebraic 
extension of E, then D is an algebraic extension of F. 


Proof. Let B be an element of D, and suppose f is a root of the poly- 
nomial g with coefficients %, a,, ..., %,- Consider the tower of fields 


Fo F(a) © Fao; %j) SS Feo ey, ---, @), 
where 


F(%, a1, Ds) at;) = F(a, a1, as ot; 1 )(0;). 


Since the last field F(a, a,,...,@,) contains all the coefficients of g, B is 
algebraic over it, and we may add to the tower 


Fay 90;, 82, a,; B) = F@5 Pee eee 


By the second corollary of 111, the field F(a, a,,...,a,, 8) and, consequently, 
the element f are algebraic over F. 


(If a, %,,..., &, are elements of a field E algebraic over a subfield F, we 
shall denote by F(a, a,..., @,) the smallest subfield of E containing F and 
Wanna Cre) 


112. Prove that the set of all algebraic numbers is a subfield of C. Show that 
the set of all algebraic numbers is countable. (Since C is not countable, this 
proves existence of transcendental numbers.) 


112B. Prove that if « and B are transcendental numbers, then either « + £ or 
aB is transcendental. 
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112y. Let ¢ be a polynomial over Q. Show that dx = z implies that a is 
transcendental. 


113. Proposition. Every irreducible polynomial over a number field has 
distinct roots. 


Proof. Let f be an irreducible polynomial over a number field F given by 
fal tt CX ett Qe 
Suppose f has a multiple root «. Then over C we can write 


fe = CX = o)""(x = a1) ee (x = Og tas 


where «,, %2,..., &,— are the remaining roots of f (distinct or not). Taking 
derivatives on both sides shows that f’ is a polynomial over F with « as aroot. 
(Indeed, we have f’x =c, +2c,x+°::+2c,x""' is divisible in C by 
(x —a)"~!.) This shows that f is not a minimal polynomial of a By 109 a 
minimal polynomial of « divides f, contradicting irreducibility. Thus, f must 
have distinct roots. 


113a. A polynomial f over a field F is separable if f and its formal derivative 
(100) have the constant polynomial | as a greatest common divisor. Prove 
the following statement: if fis a polynomial over F which splits in the extension 
field E, then f is separable if and only if the roots of f in E are distinct. (This 
explains the terminology.) 


Remark. We have departed from the accepted definition of a separable 
polynomial as one with distinct roots in a splitting field. To make sense, 
such a definition requires existence and uniqueness of splitting fields, which 
we have not developed in the text. (103B takes care of existence. See 1105 for 
uniqueness.) The above definition emphasizes separability as a property of the 
polynomial and suggests immediately a test for separability of a given poly- 
nomial (103«). 


113B. Show that an irreducible polynomial is separable if and only if its 
formal derivative is nonzero and that consequently an irreducible polynomial 
over a field of characteristic 0 is always separable. (How is this result related 
to the proposition of 1137) 


113y. A field F of nonzero characteristic p is perfectif every clement a € F has 
a p-th root in F, or equivalently, if x? — « has a root in F. (By convention a 
field of characteristic 0 is always perfect.) Prove that a field with a finite 
number of elements is perfect. 
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1135. Prove that over a perfect field of nonzero characteristic, an irreducible 
polynomial is separable. 


113e. An element « in an extension E of a field F is separable over F if it is a 
root of a polynomial separable over F. (It follows then that the minimal poly- 
nomial of « is separable. Why?) A separable extension is one in which every 
element is separable. Prove that every algebraic extension of a perfect field is 
separable. 


113¢. Prove that an imperfect field has an inseparable algebraic extension. 


Together 113 and 113 show that a field is perfect if and only if each of its 
algebraic extensions is separable. In fact many authors take this as the 
defining property of perfect fields. 


113n. Prove that every algebraic extension of a perfect field is perfect. 


1130.. Show that an element « of an extension E of a field F with nonzero 
characteristic p is separable over F if and only if F(a?) = F(a). 


1131. Let F bea field of nonzero characteristic p, and let E be a finite algeb- 
raic extension of F. We denote by E“”’ the smallest subfield of E containing F 
and the p-th power a? of every element « € E. (How do we know such a field 
exists?) Prove that E is a separable extension of F if and only if E® = E. 


113k. Fora e £, an extension field of F, prove that F(a) is a separable exten- 
sion of F if and only if @ is separable over F. 


113). Prove that if E is a separable extension of F, and if D is a separable 
extension of E, then D is a separable extension of F. 


113p. Let E be an extension field of F. Prove that the set K of all elements of 
E which are separable over F is a field. (K is called the separable closure of 
Fin E.) 


114. Theorem. Let F be anumber field, and suppose that a, B € C are alge- 
braic over F. Then there exists a number y € C algebraic over F, such that 


F(a, B) = F(). 


Proof. Let f be a minimal polynomial for a over F and g a minimal poly- 
nomial for B over F. Let « = a, anda,,a3,..., a, be the roots of f, which by 
113 are distinct, and let 6B = B, and B,, B3;,..., B,, be the roots of g—also 
distinct. Choose a nonzero element c € F distinct from all the numbers 

(a, — ;) 

(6B: — B) 
for i= 2, 3,...,n and j=2, 3,...,m. We set y=a+ cf. Then ye F(a, B) 
and is therefore algebraic over F. Consequently, we have F(y) < F(@, f). 
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Furthermore, y = «; + cB; only for i = j = 1. Now consider the polynomial 
h over F(y) given by hx = f(y — cx). The element B = B, is a root of h since 


hB = f(y — cB) = fa = 0, 


but none of the other elements, B,, £3, ..., By, is a root of h. Let g bea 
minimal polynomial for 8 over F(y). By 109 we must have g|h and g | g. This 
implies that every root of g is a root of h and of g. However, g and h have in 
common only the root f. Hence, g = a(x — 8) where ae F(y). Consequently, 
B € F(y), and therefore a = y — cB € F(y). Thus, F(a, B) < F(y), which implies 


F(a, B) = F(y). 
An extension of the form F(y) of a field F is called a simple extension of F. 


Corollary. If F, < F, < ++: < F,isa tower of number fields, each of which is a 
simple extension of its predecessor, then F,, is a simple extension of Fy. 


Proof. By 111, F, is algebraic over Fy. Let «,, «2,...,%, be elements such 
that. F;—=-F,.;(@;). Each «; is algebraic over Fy.and F,, = Fo(a,, @2,.-..5 0,). 
Repeated application of the theorem yields F, = F(y) for some ye C. 


114. Prove that every finite separable extension of a field F is a simple 
extension, that is, has the form F(y). (For a field F with an infinite number of 
elements, modify the proof above. When F is finite, use 100.) 

If E = F(y), then y is called a primitive element of E because y generates E 
over F. The statement of 114@ is generally known as the primitive element 
theorem. 


114B. Find a primitive element over Q for each of the fields Q(./2, WG), 
Q(i, ,/2), and Q(./3, 2e?*/), 


Constructions with 
Straightedge and Compass 


115. We shall determine which figures of plane geometry can be constructed 
with straightedge and compass alone. In doing so it is convenient to identify 
points of the plane with complex numbers: the coordinate point (a, b) cor- 
responds to the complex number a + bi. The plane figures constructible with 
straightedge and compass are determined by the following criteria: 
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(0) The points (0,0) and (1,0) are constructible. (Any two points of the 
plane may be chosen for (0, 0) and (1, 0) and the distance between them 
taken as the unit length.) 

(1) The line (or line segment) determined by two constructible points is 
constructible. 

(2) A circle with a constructible point as center and a constructible length as 
radius is constructible. (A constructible length is the distance between 
two constructible points.) 

(3) The intersection of two constructible lines is a constructible point. 

(4) The points (or point) of intersection of a constructible line and a con- 
structible circle are constructible. 

(5) The points (or point) of intersection of two constructible circles are 
constructible. 


Remarks. We shall call (0)-(5) the axioms of constructibility. Once they 
have been stated, the problem of constructibility with straightedge and com- 
pass is removed from the domain of mechanical drawing to the domain of 
mathematics. Axiom | indicates the only way the straightedge may be used: 
to draw the line between two previously constructed points. Axiom 2 indicates 
how the compass is used: the feet may be placed on two constructed points to 
determine a radius and then the compass transported to a third constructed 
point as center and the circle drawn. Axioms 3, 4, and 5 indicate the ways in 
which new points are constructed. A warning to the reader may prevent mis- 
interpretation: lines and circles are not to be considered as ‘“‘made up” of 
points; that a line or circle is constructible dées not imply that all points on the 
line or circle are constructible. Furthermore, we do not allow the choice of 
arbitrary points on or off lines or circles. 


116. Proposition. The line parallel to a given constructible line and passing 


Figure 6 


through a given constructible point (not on the given line) is constructible. 
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Proof. Let A beaconstructible point and /a constructible line not passing 
through A. Let B and C be constructible points which determine /. The circle 
with center A and radius AC is constructible. The circle with center C and 
radius AC is also constructible and so are its intersections with the line /. 
Let D be one of these .intersections. The circle with center D and radius 
CD = AC is constructible and intersects the circle with center A and radius 
AC in the points C and E. Thus, the point E is constructible. Finally, the 
line k determined by A and E is constructible and is parallel to /. 


117. Proposition. The perpendicular bisector of a constructible line segment 


C 


Figure 7 
is a constructible line. 
Proof. Let A and B be constructible points. The circles centered at A and 
B with radius AB are constructible and so are their intersection points C and 


D. The line determined by C and D is constructible and is the perpendicular 
bisector of AB. 


118. Proposition. The circle determined by three constructible points (not 


Figure 8 


lying in a line) is constructible. 
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Proof. Let A, B, and C be three constructible points which do not lie ona 
line. By 117, the perpendicular bisectors k and / of the line segments AB and 
AC are constructible. Consequently, their point of intersection is a con- 
structible point O. The circle with center O and radius AO = BO = CO is 
constructible and passes through 4A, B, and C. 


119. We shall call a complex number a+ bi constructible, if the corre- 
sponding coordinate point (a, b) is constructible with straightedge and compass 
according to the axioms of 115. The complex numbers 0 and | are construct- 
ible by axiom 0. 


Theorem. The constructible numbers form a field @. 


Proof. Since the numbers 0 and | are constructible, the real axis (which 
they determine) is a constructible line. Clearly, the number —1 is construct- 
ible. The perpendicular bisector of the segment between —1 and | is con- 
structible, so that the axis of imaginary numbers is a constructible line. 

First, we show that the real numbers which are constructible form a field. 
It is obvious that if @ and b are constructible real numbers, then a + b and 
—a are constructible. Suppose that a and b are constructible, positive real 
numbers. Then the numbers ai = (0, a) and — bi = (0, —d) are constructible. 
(Why ?) By 118 the circle through (— 1, 0), (0, a), and (0, — 5) is constructible. 


D = (ab, 0) 


Figure 9 


This circle intersects the real axis in a constructible point D. AB and CD are 
chords of the circle intersecting at the origin O. By a theorem of elementary 
geometry (AO)(OB) = (CO)(OD). It follows that OD = ab and D = (ab, 0). 
Thus, ab is constructible. In a similar manner, 1/a is constructible as shown in 
Figure 10. 
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Figure 10 


This completes showing that the constructible real numbers form a field. 


It is clear that the complex number a + bi is constructible if and only if the 
real numbers a and 6 are constructible. (We need to use 116 here.) It follows 


yey mate Sm (8) (a,b) 
| 
| 
| 
| 
| 

0 42,0) 
| 

Figure 11 


immediately from this observation that when a + bi and c + di are construct- 
ible complex numbers, the numbers 


(a+ bi)+(c+di)=(at+c)+(b+d)i, 
—(a+ bi) = (—a) + (—-))i 
(a + bi)(c + di) = (ac — bd) + (ad + be)i, 


and 


a (ace) = (ape) 
a+bi \a?+b?) — a+b?) 


are also constructible. 
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120. A constructible number field is a number field all of whose elements are 
constructible numbers. In others words, a constructible number field is 
nothing more nor less than a subfield of @, the field of all constructible 
numbers. Since @ is a number field, @ contains Q as a subfield; therefore Q is 
constructible. Furthermore, if «,, #.,..., %, are constructible numbers, then 
Q(a,,%2,---,%,) © @ and is a constructible number field. 


Theorem. If a constructible number field E is a finite extension of Q, then 
[E: Q] = 2’, where r is a nonnegative integer. 


Proof. First, we shall show that it is sufficient to prove the statement for 
constructible fields of real numbers. Since E£ is a finite extension of Q, we have 
E = Q(@) for some 6 € C by 114. Of course the number @ is constructible since 
6e€E, and if @ =a + bi, then a and b are constructible real numbers. Conse- 
quently, the field F = Q(a, b) is constructible. Furthermore, F is a finite 
extension of Q since F(i)=Q(0, 6, i) is a finite extension of Q. (@ =a — bi is 
algebraic over Q because @ is.) If [F: Q] = 2°, we have 


[F(i): EWE: Q) = [F@: Q] = [FQ@): F][F: Q] = 2°*? 


from which it follows that [E: Q] = 2” where r<s+ 1. 

Beginning with the field Q, we can construct any point with coordinates in 
Q, and consequently, we can construct any line ax + by + c = 0 or any circle 
(x — p)> +(y—q)* =r’, where a, b, c, p, gq, r€ Q. (We shall call these the 
points, lines, and circles of Q.) The only way we can obtain points whose 
coordinates do not lie in Q is by the intersections of lines and circles of Q. 
Two lines of Q will intersect in a point of Q if they are not parallel. However, 
the points of intersection of a line of Q and a circle of Q, or the points of 
intersection of two circles of Q will, in general, have coordinates in a quad- 
raticextension Q(./a) where a € Q, and « > 0. (The reader should verify this 
for himself.) 

Clearly, the same reasoning applies to any field F of real numbers: the 
points of intersection of lines and circles of F have coordinates in some quadratic 
extension F(/ a) where a € F anda > 0. (Note that a depends upon the parti- 
cular lines and circles being intersected and will vary from one instance to 
another. ) 

Now it is clear that beginning with Q and using only the methods of con- 
struction prescribed by the axioms of 115, we can reach only points which have 
coordinates in some tower 


O = Foe Fpes4es, 


where F; = Fys(/a) for a; € F;_, and «, > 0. Consequently [F;: Q] = 2/. 
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Suppose now that F is a constructible field of real numbers and a finite 
extension of Q. By 114, F = Q(é) where € is a constructible real number. Now 
&€ must lie in a tower 


Q=AychccF, 

of the type described above. Since € € F,, F = Q(¢) c F,. Now 
ee Er Oh [FQ] = 2", 

from which it follows that [F: Q] = 2° for some s. 


120e. Let F be a constructible number field and £ an extension field of 
degree 2 over F. Prove that EF is constructible. (This will be used in 135.) 


121. Trisection of Angles. Asa first application of the preceding theory, we 
shall demonstrate the impossibility of a general construction which trisects 
an arbitrary angle using only a straightedge and compass as prescribed in 
the axioms of constructibility (115). 

First let us show how angles may be trisected easily if we allow an incorrect 
usage of the straightedge. (Apparently this practical construction was known 
to ancient geometers.) 


Figure 12 


We assume that an angle « is given. With the compass set at a given length r, 
a circle is described with the vertex of a as center, so that the sides cut the 
circle at points A and B. Now this same given length r is marked on the 
straightedge. The straightedge is positioned so that the points marking the 
distance r fall upon the line OA and the circle (at C and D) and so that the 
line they determine passes through B. Since OBC and OCD are isosceles 
triangles, we have 


a= LAOB= LODB+ LOBD 

LODC + LOCB 
LODC + LODC * LCOD 
=3L0DC. 
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To demonstrate the general impossibility of trisecting angles with straight- 
edge and compass, it is sufficient to exhibit one angle for which trisection is 
impossible. We choose 60°. 

First, we observe that an angle « is constructible if and only if the real 
number cos « is constructible. Thus, we have only to show that cos 20° is not 


| COS a 


Figure 13 


constructible. For any angle 0, we have cos 30 = 4 cos* @ — 3 cos @. Since 
cos 60° = 1/2, it happens that cos 20° is a solution of the equation 4x? — 3x = 
1/2, or what is the same thing, a root of the polynomial 8x? — 6x — 1. 

The polynomial 8x* — 6x —1 is irreducible over Q: the substitution 
x = Wy + 1) yields y? — 3y? — 3, which is clearly irreducible by the Eisenstein 
criterion (107); were 8x° — 6x + 1 reducible over Q, the same substitution 
applied to its factors would yield a factorization of y* — 3y? — 3. 

Now it follows that 8x° — 6x — 1 is a minimal polynomial for cos 20° over 
Q and that [Q(cos 20°): Q] = 3. By 120 we are forced to conclude that 
Q(cos 20°) is not a constructible number field and that cos 20° is not a con- 
structible number. Thus, the angle 20° cannot be constructed. 

There are four famous problems of antiquity concerned with straightedge 
and compass constructions. One is the trisectability of angles, which we have 
just disposed of. Another is the constructibility of regular polygons, which we 
shall take up in articles 135-138. A third is the squaring of the circle, that is, 
the problem of constructing a square with area equal to that of a given circle. 
Algebraically this is equivalent to the constructibility of the number Jn, 
which is clearly impossible once it has been proved that z is transcendental. 

The fourth problem is the duplication of the cube. The legend is that the 
citizens of Delos inquired of the oracle at Delphi what could be done to end 
the terrible plague decimating their city and received the answer, “ Double 
the size of the altar of Apollo.’”’ They replaced the cubical altar with anew one 
whose sides were twice the length of the sides of the original, but still the 
plague reigned. A second consultation of the oracle revealed that the require- 
ment was to double the volume of the original altar. Of course this is 
equivalent to the problem of constructing 2/2, which is not possible with 
straightedge and compass because the minimal polynomial for 2/3 over the 
rational field Q is x? — 2, and [Q(3/2): Q] = 3. 


Galois Theory 


Chapter 4 


The Galois theory of equations is one of the most beautiful parts of mathe- 
matics and one of the roots of modern algebra. The basic idea of Galois 
theory is that for a given field, every extension field of a certain kind has 
associated with it a group, whose structure reveals information about the 
extension. In particular, splitting fields of polynomials have this property, and 
solvability of the associated group determines solvability of the polynomial in 
radicals. Consequently, to prove that equations of the fifth degree are not 
always solvable in radicals (over the rational field Q), we have only to find a 
polynomial equation whose splitting field is associated with the symmetric 
group S,, which we know is not a solvable group. 

This elegant theory is the work of the tormented genius, Evariste Galois 
(1811-1832), whose brief life is the most tragic episode in the history of 
mathematics. Persecuted by stupid teachers, twice refused admission to the 
Ecole Polytechnique, his manuscripts rejected, or even worse, lost by the 
learned societies, Galois in bitterness immersed himself in the radical politics 
of the revolution of 1830 and was imprisoned. Upon his release he got involved 
in a duel and was fatally wounded, dying before his twenty-first birthday. 
His manuscripts, hastily scribbled in prison and on the eve of his duel, did not 
receive the attention they deserved until they were read by Liouville in 1846. 
Only in 1962 was the critical edition of all Galois’s writings finally published, 
but his reputation as a genius of incredible power has been secure for over a 
hundred years. 
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Automorphisms 


122. An automorphism of a field E is a one-to-one onto mapping, ¢: E> E, 
which preserves addition and multiplication, that is, 


$+ B)=da+ Bh and (af) = (baNPA). 


In other words, ¢ is an automorphism of the additive group structure and ¢*, 
the restriction of ¢@ to E*, is an automorphism of the multiplicative group 
structure. 

If @ and w are automorphisms of the field E, then so is their composition 
ow. The inverse of an automorphism is again an automorphism. It is easy to 
see that the set of automorphisms of a field E is a group, which we denote 
GE). 

Of course the identity mapping 1, is an automorphism of the field £, and it 
is the identity element of the group H&E). 

Whenever we speak of a group of automorphisms of a field E, we shall 
understand that the group product is composition of automorphisms. In other 
words, the term “group of automorphisms of £” is synonymous with 
“subgroup of &E).” 


122«. Show that the groups of automorphisms 4(Q) and &(Z,) are trivial 
groups. 


122B8. Determine the group of automorphisms of a field with four elements. 
122y. Determine the group of automorphisms of Q(i) and Q(./2). 


1225. Prove that the group of automorphisms of Q(¢) where C = etl? 
p prime, is isomorphic to Z,. 


122. Let ¢ be an automorphism of a field E. Prove that the set 
F= {ae EF|¢da =a} 
is a subfield of E. 


122¢. Let E be a finite field of characteristic p. Show that the mapping 
o: E> E given by ¢a = @ is an automorphism. Under what conditions is @ 
an automorphism when E is infinite? 


122n. Let E = F(a) and suppose that B is a root in E of a minimal poly- 
nomial of « over F. Show that there is a unique automorphism ¢: E > E, such. 
that da = B and ¢c = c for ce F. 


123. Let @ be an automorphism of the field E. We say that @ leaves fixed an 
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element a € E if 6a = «a. We say that ¢ leaves fixed a subset X¥ of E if da =a 
for all « € X. The set 


{ae E| da =a} 


clearly forms a subfield of E, which we call the fixed field of ¢. The fixed field 
of @ is the largest field left fixed by ¢. 
If ¢,, 62, ..., &, are automorphisms of E, then the set 


{ee El Ga = b2.4 =": = $4 =a} 
is called the fixed field of ¢,, 2, ...; Dy- 
As an example, let ¢: Q(,/2) > Q(./2) be the automorphism given by 


d(a + b,/2) =a — b,/2, 


where a, b € Q. Then the fixed field of @ is just Q. 

For any subset X ofa field E the automorphisms of E which leave fixed the 
set X form a group which we denote GE, X). 
1230. Let ¢ = e?"/5, and let d denote the automorphism of Q(¢) given by 
ot =C*. Prove that the fixed field of ¢ is Q(/5). 
123B. Let @ be an automorphism of a field E leaving fixed the subfield F. 
Show that « e€ E and a root of fe F[x] implies da is also a root of f. 


123y. Let ¢ be an automorphism of a field E with fixed field F. Show that ¢ 
extends uniquely to a mapping 


¢: Ex] > Ele] 


with the following properties: 


(1) ¢c=¢c for any constant polynomial c, 
(2) ox =x, 
(3) $(f+9) =f) + 9), 
(4) $(f9) = (O/)(¢9). 
Furthermore, show that f= / if and only if fe F[x]. 


1235. Let ¢ bean automorphism ofa field E with fixed field F. Suppose that 
fe E[x] is monic and splits in £. Prove that if fe = 0 always implies f(¢«) = 0 
for a € E, then fe F [x]. 


123. Let E be the splitting field in C of the polynomial x* + 1. Find auto- 


morphisms of E which have fixed fields Q/ =), Q,/2), and Q(i). Is there 
an automorphism of & whose fixed field is Q? 
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124. A finite set of automorphisms of a field FE, {¢,, 6,,..., >, }, is called 
linearly dependent over E if there are elements of E, c,, C2, ..., C, € E, not all 
zero, such that 


€1(P1%) aR C2(P2 ct) omy wg CHPn a) =0 


for all elements «. Otherwise {f,, 62,..-, &,} is called linearly independent. 
(Although these definitions are made by analogy with the situation for vector 
spaces, this analogy is formal and the automorphisms @¢,, ¢2,..., @, should 
not be viewed as elements of a vector space.) 


Proposition. If ¢,, 6,,..., $, are distinct automorphisms of E, then the set 
{b,, 2, ---; Py} is linearly independent over E. 


Proof. Suppose {¢,, 62,..., @,} is linearly dependent over E. Among all 
the relations of linear dependence involving the @,’s, there is a shortest one 
(that is, one with the fewest nonzero coefficients). Renumbering if necessary, 
we may assume that such a shortest relation has the form 


C1(b1 4) + Co(G2%) + °° + ¢,(G, 0) = 0, (1) 


for all ae E, where r <n, and c,,c,,..., c, are nonzero (the zero coefficients 
have been deleted). Choose an element f € E such that ¢,6 # ¢, 8. (The @,’s 
are distinct by hypothesis.) Now we have 


€1($1B)(G1%) + C2(2 BY G2 %) + °-+ + c(G, BY, x) = 0, (2) 
c1(, B)(b1%) + c2($, B)(2 %) + °°: + c($,B)(O, 4) = 0, (3) 
for all wa € E. Here (2) is obtained by substituting Ba for « in (1) and observing 


that $,(Ba) = $,(B)¢,(«). Equation (3) is the result of multiplying (1) by @, B. 
Subtracting (3) from (2) gives the new and shorter relation 


C1(G,%) + €3(G2%) + °° + -1(G,-1%) = 0 (4) 
for all a where c; = c,(o; 8 — $, 8). Then c, = c,(¢,B — $, B) # 9, and relation 


(4) is nontrivial. This is a contradiction, since (1) is the shortest relation, and 
it establishes the linear independence of {¢,, 2, ..., On}. 


125. Proposition. If b,,62,..., , are distinct automorphisms of E, each of 
which leaves fixed the subfield F of E, then {[E: F]>n. 


Proof. Suppose [E: F] =r <n and that {@,, w,,..., w,} 1s a basis for E 
over F. Consider the system of equations with coefficients in E, 
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(p11 )X, + (Pr @)x2 +°+* + (b, @1)x, = 0, 
(p1@2)x, + (2 W2)x2 + -*: +(b, 2)x, = 0, 

(*) 
($1@,) x, ate (¢2 W,)X2 Stet ote (¢, W,)Xn = 0. 


Since this system has fewer equations than unknowns, there is a nontrivial 
solution x; =Cc,, X, =C2, ..., X, =C,.- (That is, not all the c;’s are zero.) 


Now we show that this implies that {¢,, 62, ..., @,} is linearly dependent— 
in fact we show that 


C1(P1%) + Co(P2 @) + ++ + C,(b, 0) = 0 


for all we E. Since {w,, @,,..., @,} is a basis for E over F, we can write 
any «e€ E as 


4= 4,0, + a,@,+°°'+4,0, 


for unique a,, a,,..., a,€ F. Now we have ¢,(a;) = a,;, and therefore, 


Leta) = ¥(6(¥.4,0,)] 


ss Ss C;aj(P;@;) 


i=1 j=1 


La (dew, ) ») 


Since c,, C2, ..., C, give a solution of the system (+), it follows that 


Y ¢f{¢;0) =0 for j=1,2,...,$r. 


i=1 
Thus, )7_, ¢(o;«) = 0, and {4,, $2,..., ¢,} are dependent. Since our hypo- 
thesis includes that ¢,,6,,..., ¢, are distinct, this contradicts 124. Con- 


sequently, our assumption that [E: F] <n is incorrect, and _ therefore, 
[E: F] >n. 


N.B. The hypothesis is only that F is left fixed by each ¢;, not that Fis the 
fixed field of $,, d,,..., d,- 


126. Proposition. If F is the fixed field of a finite group G of automorphisms 
of E, then [E: F] = o(G). 


108 4 Galois Theory 


Proof. Let G= {d,,2,-.-.-.,%,} and suppose [E:F]=r>n. Let 
{w,,@2,..., @,} be a basis for E over F. Consider the system of equations 
over E, 


(P,@1) x1 + (fi @2)x2 +++ + (G1 @,)x, = 0, 
(Pz @)X, + (G2 @2)x2 + +** + (G2 @,)x, = 0, 

5 (*) 
(Pn @1) x1 + (P,@2)X2 + ++ + (bn @,) x, = 0. 


Since this system has r — ” more unknowns than equations, there is a non- 
trivial solution x; = c,, x2 =C2,..., X, = c, in which r — n of the c,’s may be 
chosen arbitrarily. For i= 1, 2,..., r let 


a; = GC; + O20; + °°" + bnG;. 


We may choose ¢,, C2,..., C,-, SO that a,,a,,..., a,_, are nonzero. (Why ?) 
For i= 1, 2,..., r the elements a; are left fixed by each element of G, and 
consequently, a,, a,, ..., a, € F, the fixed field. Now we have 


Yawi= Y (Fd c)oi= ¥ 4, ¥ ed7'a)) =0 

i=1 i=1\j=1 j=1 i=1 
since t=, c(¢;'@,;)=0. This contradicts the linear independence of 
{@,,@,-..,@,}. It must be that [E: F] < o(G). On the other hand, we 
know from 125 that [E: F] = 0(G) =n. Thus [E: F] = o(G). 


126%. Let E denote the splitting field in C of x* +1 over Q. Prove that 
[E: Q] = 4. 

1268. Find a group of automorphisms of Q(¢), where ¢ = e?*"/°, of which 
the fixed field is Q, and determine [Q(¢): Q]. Howelse can [Q(¢): Q] befound? 


126y. Let E denote the splitting field in C of x* — 2 over Q. Find a group of 
six automorphisms of E with fixed field Q, thereby showing that [E: Q] = 6. 


Galois Extensions 


127. A field E is a Galois extension of F if F is the fixed field of a finite group 
of automorphisms of E, which we call the Galois group of E over F and 
denote 4(E/F). With this definition we may restate succinctly the proposition 
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of 126: the degree of a Galois extension is the order of its Galois group. In other 
words, when E is a Galois extension of F, we have [E: F] = o(@(E/F)). It 
follows that the Galois group Y(E/F) contains every automorphism of E 
which leaves F fixed: were there one it did not contain, then by 125 we would 
have [E: F] > o(@(E/F)), contradicting 126. 


Remark. Some authors use “normal extension” in place of ‘Galois 
extension.”’ This is unfortunate, since “‘normal extension’ has another more 
generally accepted use. (See 129«.) 


127. Show that an extension of degree 2 is Galois except possibly when the 
characteristic is 2. Can a field of characteristic 2 have a Galois extension of 
degree 2? 


127B. Show that Q(¢), where { = e?*'/°, is a Galois extension of Q. 


127y. Show that Q(3/2) is not a Galois extension of Q. Find a Galois exten- 
sion of Q which contains Q(3/2) as a subfield. 


1275. Suppose that E = F(a) is a Galois extension of F. Show that 
fx = (x — Ga) — G2 4)°* (x — $0) 


is a minimal polynomial for « over F, where ¢,, $2,..., @, are the elements of 
G(E/F). 


127e. Let E be a Galois extension of F and suppose that a € E is an element 
left fixed only by the identity automorphism of E. Prove that E = F(a). 


128. Proposition. Let $,, 2, ..., $, be distinct automorphisms of a field 
E, each leaving fixed the subfield F. If [E: F] =n, then Eis a Galois extension 
of F with group 


GEL) = {h,, do By ce Re} Pn}: 


Proof. Since {¢,,¢2,.-.-, >,} © #E), to show that this set is a group 
under composition we need only verify that it is a subgroup of 9 (E). Suppose 


that a composition ¢;¢;¢ {¢, 2, -.-, Ga}. Then ¢,¢; in addition to 
1, P2,---> Pn» leaves F fixed, which by 125 implies [E: F] > + 1, a con- 
tradiction. A similar contradiction arises if ¢;' ¢ {¢,, 2, .... ¢,}. Con- 


sequently, {¢,, $2,..., ,} isa subgroup of G(E), and hence a group. The 
fixed field F’ of {f,, $2, ..., ¢,} contains F and satisfies [E: F’] =n by 126. 
The equation 


[EoF) =(E2F EDF) 


yields [F’: F] = 1, which means F = F’. This proves the proposition. 
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128%. Let E be an extension of Z, such that [E: Z,] =n. Since E* is a 
cyclic group (100) and has a generator 0, we know that E = Z,(6). Let 
~: EE be given by $x = x. Prove that 1,¢,67,..., 6”! are distinct auto- 
morphisms of E leaving Z, fixed and conclude that E is a Galois extension of 
Z, with cyclic Galois group 


G(E|Z,) = (1, 6, 67, ..., 674}. 


129. Theorem. E is a Galois extension of F if and only if the following con- 
ditions hold: 


(1) an irreducible polynomial over F of degree m with at least one root in E 
has m distinct roots in E; 

(2) Eis a simple algebraic extension of F, that is, E = F(0) for some element 
6 €E which is algebraic over F. 


Proof. Necessity of condition (1). Suppose E is a Galois extension of F 
with group @E/F) = {¢,, 62, .-., d,}. Let f be a polynomial irreducible over 


F with a root aE E. We let a,,a,,..., a, denote the distinct values among 
the elements ¢,0, d,0,...,0,%¢€£. Then any automorphism in &(E/F) 
simply permutes the elements «,, «, ..., &,. It follows that each automor- 


phism of G(E/F) leaves fixed: all the coefficients of the polynomial 


gx = (x — %)(x — 0) ° ( — a). 


Thus, all the coefficients of g lie in the fixed field F, that is, g is a polynomial 
over F. Since a is among the elements a,, #7 ,..., &, we have ga = 0, and con- 
sequently, f| g. However, g splits in E, and therefore f must split in E. Clearly, 
the roots of f are all distinct. 

Necessity of condition (2). If F is a finite field, then so is E. (Why?) There- 
fore the multiplicative group E* is cyclic (100) and has a generator @. It 
follows that E.= F(@). This takes care of the case where F is finite. 

Suppose F is infinite. Let 6 € E be an element left fixed by as few auto- 
morphisms of the Galois group Y(E/F) as possible. (Why does such an 
element exist ?) We let 


Go = {be HE/F)| G0 = 9}. 


Clearly, Y, is a group. We claim that Gg consists of the identity automorphism 
alone, or in other words, that o(G,) = 1. Suppose o(G,) > 1. Then the fixed 
field B of Y, is a proper subfield of E since [E: B] = 0(G,). Let ne E— B. 
Then there is at least one automorphism in Y, which does not fix yn. Take 
M1, 2,-++,, to be the distinct values among ¢,y, $2, ..., $,4, and similarly, 
take 6,, 8,,..., 0, to be the distinct values among $9, $20,..., 6,0. We may 
assume 9 = 4, and @ = @,. Since F is infinite, we may choose c € F to differ 
from all the elements 
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(n; — 11) 
(0; — )) 


fori=2,3,...,r and j=2,3,...,5. LettC =n+c0=n, + cO,. The ‘choice 
of c insures that € =n; + cO; only when i=j = 1. Nowif € is left fixed by 
de GYE/F), we have 


C= $f = O(n + cB) = (bn) + ($0) = 9; + 0; =n + CO, 


which implies that gn = n and $0 = 9. Thus € is fixed only by the automor- 
phisms of Y(E/F) that fix both n and 6. Since there is at least one automor- 
phism in Y, which does not fix n, it is evident that € is left fixed by fewer 
elements of Y(E/F) than 9. This contradicts the choice of 8 and establishes 
that Y, has order |. It follows that the elements $,0, $2 90,..., 6,0 are dis- 
tinct: p;6 = $,6 implies $; 'd;6 = 9, from which we infer that $7 '¢; =1¢ 
and $; = $;. Furthermore, the elements ¢,0, 620, ..., 6,9 are all roots of a 
minimal polynomial for 6 over F. (Why?) Therefore we have [F(@): F] > xn. 
However, F(@) c E and [E: F] =n. Consequently, E = F(6@). 

Sufficiency of the conditions. Let E be an extension of F, which satisfies 
conditions (1) and (2). Then E = F(@) and the minimal polynomial of @ has n 


distinct roots 6, = 0,0,,...,0,, where n = [E: F'] is the degree of the minimal 
polynomial. Since for all i, F(0;) c E and [F(0;): F] =n, we have F(6;) = E. 
Now we construct n automorphisms of E, ¢,,¢2,...,0,, by setting 


(0) = 0;. Since the set 1,0, 07,...,0"~! forms a basis for E over F, each 
element of E may be written as g@ where g is a polynomial over F of degree 
less than n. Now we set $,(g0) = g0;. It follows (as the reader should verify) 
that the mappings ¢,, ¢2,.-.., ¢, are automorphisms of E. Clearly, they leave 
F fixed, and therefore E is a Galois extension of F with group Y(E/F) = 
{d1, 25 TRS S59) Pn}. 


129a. An extension E of a field F is normal if every irreducible polynomial 
over F with a root in E splits in E. Prove that an extension E of F is Galois if 
and only if it is finite, separable, and normal. (For the definition and proper- 
ties of separable extensions see exercises 113a@-113p. See also 114&.) Since all 
extensions in characteristic 0 are separable, we may conclude that a finite 
extension of a field of characteristic 0 is Galois if and only if it is normal. This 
explains the occasional use of the word ‘‘ normal’ for what we call ‘‘ Galois” 
extensions. 


129B. Give an example of an extension which is finite and separable but not 
normal. 


129y. Give an example of an extension which is separable and normal but 
not finite. 
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1295. Let E denote the field of rational functions Z,(x). (See 98a.) Let 
F = Z,(x?). Show that E£ is a finite normal extension of F, but not separable. 


130. The Fundamental Theorem of Galois Theory. Let E be a Galois exten- 
sion of the field F. If B is a field between E and F, then E is a Galois extension 
of B and GE/B) is a subgroup of GE/F). Furthermore, B is a Galois 
extension of F if and only if G(E/B) is a normal subgroup of G(E/F), in 
which case G(B/F) is isomorphic to the quotient group G(E/F)/G(E/B). 


Proof. First we show that E is a Galois extension of B. By 129, E = F(@) 
for some 0 € E. Clearly E = B(6) also. If fis irreducible over B witha rootae E, 
then /|g where g is a minimal polynomial for a over F. By 129, g has all its 
roots (which number deg g) in E, and they are distinct. Because /| g, the same 
is true of f. Thus, conditions (1) and (2) of 129 hold for E as an extension of B. 
Consequently, E is a Galois extension of B. Since Fc B c E, it is obvious that 
G(E/B) is a subset of G(E/F). Both are subgroups of G(E), hence Y(E/B) is a 
subgroup of G(E/F). 

Suppose B is a Galois extension of F. Then B = F(é) forsome €e€ B.Ifgisa 
minimal polynomial for € over F and deg g = m, then g has m distinct roots in 
B—all the roots it can have. If ¢ e G(E/F), then d(gt) = g(é) = 0, and pé 
is aroot of g, hence $é € B. It follows that @ maps B into B, since the element 
€ generates B over F. Thus for each automorphism ¢ € Y&(E/F), its restric- 
tion to B, denoted ¢| B, is an automorphism of B. Furthermore, since ¢ fixes 
F, $|B also fixes F, and therefore ¢| Be G(B/F). All this information 
can be summed up as follows: there is a group homomorphism h: G(E/F) 
— G(B/F) given by h(d) = ¢|B. The kernel of his the subset of Y(E/F) con- 
sisting of all automorphisms whose restriction to B is just 1,. In other words, 
Ker h = G(E/B). The kernel of a homomorphism is alwaysa normal subgroup 
(65). Consequently, Y(E/B) is a normal subgroup of GY(E/F). 

Suppose on the other hand that we know G(E/B) to be a normal subgroup 
of G(E/F). Then for ¢ € Y(E/B) and wy € Y(E/F) we have wow € Y(E/B), 
and for Be B we have wy" ‘owB = 8B, or dWB = WB. Fixing w and letting ¢ 
run through GE/B) shows that WB belongs to B, the fixed field of Y(E/B). 
To summarize, Be B and We G(E/F) imply wBe B. We may once again 
define a homomorphism h: G(E/F) > GB, F) with Ker h = Y(E/B). (Here 
G(B, F) denotes the group of automorphisms of B which leave F fixed; we 
do not yet know that F is the fixed field of G(B, F).) By 67 we know that h 
induces a monomorphism 


h': GE/F)/GE/B) > G(B, F). 
Now it follows that 


[B: F] = [E: FJ/[E: B] = o(@(E/F))/o(G(E/B)) = of AB, F)): 
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On the other hand, F is contained in the fixed field of (B, F) and therefore 
[B: F] > o(G(B, F)). Consequently, [B: F] = o(G(B, F)). Then 128 implies 
that B is a Galois extension of F with group 4(B/F) = 4B, F). Finally, we 
note that 


o(GW(E/F)/G(E/B )) = (A B/F)) 
implies that h’ is an isomorphism. 


130. Prove that if E is a Galois extension of F, then there are only a finite 
number of fields between E and F. 


130B. Let E = Q(¢) where ¢ = e?*/’. Show that E is a Galois extension of Q 
and determine the Galois group. Find all the fields between Q and E, the sub- 
group of Y(E/Q) to which they belong, and determine which are Galois exten- 
sions of Q. 


1307. Let E bea Galois extension of F and let B, and B, be two intermediate 
fields. (That is, Fo B, cE and Fc B,c E.) We say that B, and B, are 
conjugate if there is an automorphism ¢ € 4@(E/F) such that $B, = B,. Show 
that B, and B, are conjugate if and only if the groups Y(E/B,) and Y(E/B,) 
are conjugate subgroups of Y(E/F). 


1305. Let E be a Galois extension of F with Y(E/F) a cyclic group of order 
n. Prove that the following conditions hold: 


(1) For each divisor d of n there exists precisely one intermediate field B 
with [E: B] =d. 

(2) If B, and B, are two intermediate fields, then B, c B, if and only if 
[E: B,] divides [E: B,]. 


130. Prove the converse of 1306. In other words, show that if (1) and (2) 
hold for the intermediate fields of a Galois extension E of F, then G(E/F) 
is cyclic. 


130¢. Let E be a finite extension of F and let B, and B, be intermediate 
fields such that no proper subfield of E contains both B, and B, . Show that 
if B, is a Galois extension of F, then E is a Galois extension of B,, and that 
G(E/B,) is isomorphic to a subgroup of Y(B,/F). Show that B, 0 B, = F 
implies that G(E/B,) ~ G(B,/F). 


130n. With the same hypotheses as in 130 prove that if B, and B, are both 
Galois extensions of F, then E is a Galois extension of F. Show further that 
when B, 0 B, =F, GE/F) = GE/B,) x QE/B;). 


114. 4 Galois Theory 


1300. Let C(x) denote the field of rational functions over C, the field of 
complex numbers. Consider the six mappings ¢; : C(x) + C(x) given by 


— 1 
b1:f0) 2f, bat fo) (7), 


bif@)f0-9, — bs:f0)-4(—), 


l-—x 
XG 
bs:f) SUID, be: f@) ~s(), 


for any rational function f(x) € C(x). Verify that these mappings form a 
group of automorphisms of C(x), and determine the fixed field of this group. 
How many intermediate fields are there? 


1301. Prove that a finite extension of a finite field is Galois with a cyclic 
Galois group. 


131. Symmetric Polynomials. We digress briefly to prove a result on poly- 
nomials in several variables needed in the next article. For brevity we avoid 
studied rigor and appeal to intuition. Readers requiring a more thorough 
discussion will find one in van der Waerden’s Modern Algebra, Chapter IV, 
§26. 

Let F” denote the n-fold cartesian product F x F x -:: x F. (Recall that a 
point of F" is an n-tuple (c,, c,,..., c,) of elements of F.) 

A polynomial in n variables over the field F is an expression of the form 


S Ogg Nee 5%) = YC, V7... , Vel en 


where )" denotes a finite sum, the coefficients c(v,, v2,..., v,) are elements 
of F, and the exponents v,, v,,..., v, are nonnegative integers. Each term 
x}'x3? +++ x7" is called a monomial, and its degree is the sum v, + v. +°*' + V,- 
The degree of a polynomial is the highest degree among its monomials with 
nonzero coefficients. 


Given a polynomial fin n variables over F and a permutation of n letters, 
meéS,, we define a new polynomial /” in n variables over F by setting 


Fis X25 -+29%n) =F Xnagetntey ge ee 
For example, suppose 
FQ, %, X53) = 3 + x, Xe and = cee 
Then 


fo (ee X2; x3) = f(x, »X3> a) = x} 3 X34. 
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A polynomial fin 7 variables over F is symmetric if f* = ffor all ze S,. In 
other words, a symmetric polynomial is one which remains the same under all 
permutations of its variables. For example, 


gs 88g Sy) SO ee 


is symmetric. The most important symmetric polynomials, as we shall see, are 
the elementary symmetric functions o,, 02, ..., 0, defined by the equation 


(XH — xX — x2)-°-(X — x,) = X"-— 9, X""' +0, X" 7 —+ +> + (-1)"o,. 


It follows that 


GM ok ans Xp) =X + Xa +o +X 
PGi 50. e)— DX, 
: i<j : 
6,(X1, x2, eres Ky) = Types “Xp 
In general, o,(x,, x,,..., X,) is the sum of all the monomials x;,x;, °** %i,, 
where i, <i, <:+:<i,. If g is any polynomial in the elementary sym- 
Metiomumcuons, then g(o,,¢,,..., ¢,) =f (X1,.X%2,-<.,%,), where f is a 


symmetric polynomial. For example, 


of — 20, =xPtxZte +x. 


Theorem. A symmetric polynomial in n variables over the field F can be 
written uniquely as a polynomial in the elementary symmetric functions 
TyepOg «>. ¥6,-over F. 


Proof. The proof is by induction on the number of variables. The case 
n= 1 is trivial. Assume the statement is true for polynomials in ” — 1 (or 
fewer) variables. The induction step from n — | to n will be proved by induc- 
tion on the degree of the polynomial. The case of zero degree is trivial. 
Suppose that the statement is true for polynomials in 7 variables of degree 
less than m (as well as for all polynomials in fewer variables). Given a poly- 
nomial f of degree m which is symmetric in n variables, we let f’ denote the 
polynomial in n — | variables given by 


f'(%, x2, td ie) = f(x, x2, ms eae OD 


Then /’ is symmetric and may be written as a polynomial g(a", C4 515. FOR.4) 
in the elementary symmetric functions in n — 1 variables, o},05,..., 6; 
We note that 


ORR Key. -, Ngsy) OLR AL 5 «<5 Klay} DO): 
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Now consider the polynomial / in n variables given by 
WX, X25 +++ Xe = IX Mose) S Gia os wae bee 
Clearly, # is symmetric and furthermore 


Ao, Kee n-12 0) =0 


Consequently, x, divides h, and by symmetry so do x1, X2,..., X,-4. It follows 
that A = o,h, where h is symmetric and has degree less hans m. a, our induc- 
tion hypothesis, A may be written as a polynomial g(c,, 02, ..., ¢,). Finally, 
we have 


S(%1; X2, Pee = (4, G2, = +5 On—1) P Onley, Ca. ey 


We omit the uniqueness argument for brevity. 


13le. Express the following symmetric polynomials in terms of elementary 
symmetric functions: 


XT + XyX2 +43 + .x2%3 +23 +x5%, 
Xixs + xg xe EXE xe, 
(x1 — X2)7(x2 — x3)*(x3 — 1)’, 
x+x3+ x3, 
xt + x3 +x. 
131B. Let mx, x2,---> %,) =X, txet°':+2x%. Prove the following 
Newton identities (used in 147): 


(for k <n) nm, # (=1fko, = (— Ds (eee 
i=1 
(for k >n) m= (1? eee 
i=k—-n 


13ly. Supply the proof of uniqueness for the theorem of 131. 


1315. Let F(x,, x,,..., x,) denote the field of rational functions over F in 
the n variables x,, x.,..., x,- In other words, F(x;, x2,.-.., X,) is the field of 
all quotients p/q where p and q are polynomials over Fin x,, X2,..., X,- Prove 
that F(x,, x2,-.-, X,) is a Galois extension of F(¢,, 0,,...,6,) with group 
isomorphic to S,, the symmetric group on n letters. (As above, 0,, 0,,..., o, 
denote the elementary symmetric functions of x,, x2, ..., X,-) 


132. Let F be a number field and let f be a polynomial over F. The funda- 
mental theorem of algebra (101) implies that f splits over C, the field of 
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complex numbers. In general there will be many number fields in which f 
splits, but the smallest such field (containing F) is unique; it is called rhe 
splitting field for f over F. If a,, %,,..., a, are the roots of f, then it is clear that 
the splitting field for f over F is the smallest field containing F and a,, «,..., 
Gepethates, F(a 02, ...5 %) 


Theorem. If E is the splitting field of a polynomial f over the number field F, 
then E is a Galois extension of F. 


Proof. Ve shall verify the conditions of 129. As we remarked above, 
E = F(a, a,,...,,), where a,, @,...,@, are the roots of f. It follows that 
every element of E may be written (in several ways perhaps) as 


GOO, 2 +s Oa), 


where g is a polynomial in n variables over F. 
Suppose that / is an irreducible polynomial over F with a root B € E and 
degh=m. We write B = g(a,, &2,..., &,) as above, and for mE S,, we set 
B™ = G"(Oy, Hp, ..+5 Uy) = GIon1)» An(2)s +09 On(n))s 


Consider the polynomial p given by 


p(x) = [] («— B*). 


neéeSn 


All the coefficients of p(x) are symmetric polynomials in the roots a, @2,..., 
a, of f By 131 we have that each coefficient of p(x) may be written as a poly- 
nomial in the elementary symmetric functions o,(a,,4,,...,%,),=1,2,...,n. 
However, each o,(a,, a,,..., ,) € F, since 


fx = C(x — a, (x — a) °° (x — @,) = e(x" — x71 4+ +: (—1)"0,) 


is a polynomial over F. Thus p is a polynomial over F and pf = 0. Since h is a 
minimal polynomial for B, we have h| p, from which it follows that / has all 
its roots in E. This verifies condition (1) of 129. Condition (2) of 129 is a 
consequence of 114. Thus, the theorem is proved. 


Corollary. The Galois group, G(E/F), is a group of permutations of the roots 
Bi, Seo soy &, Of fF. 


Proof. Since each ¢ € G(E/F) leaves F fixed and / is a polynomial over F, 


(fai) = f(Pa;) = 9 


and da, is again a root of f. 
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132a. Indicate the modifications of the proof of the theorem above which are 
necessary to prove the following more general theorem. 

Let E be the splitting field (in some extension) of a separable polynomial f 
over the field F. Then E is a Galois extension of F. 

Show that if separability of fis dropped from the hypotheses, we may still 
conclude that E is anormal extension of F. (See 129a for definition of normal.) 


132B. Let E be the splitting field in C of a polynomial f over Q with no 
repeated roots. Show that Y(E/Q) acts transitively (86) on the roots of f if 
and only if f is irreducible. 


132y. Let E be the splitting field over Q of a polynomial of degree n. Prove 
that o(Y(E/Q)) divides n!. 


1325. Let E be the splitting field over Q of a polynomial of degree 8 which is 
reducible over Q but has no root in Q. Show that [E: Q] < 1,440. 


133. In this article we give an example of a Galois extension for which we 
can compute the Galois group explicitly. 

Let K denote the splitting field over Q of the polynomial x* — 2, which is 
clearly irreducible over Q by the Eisenstein criterion (107). The roots of x* —2 


are +4/2 and +i ny Os Clearly K = Q(é/2, i), and consequently each auto- 
morphism of GK, Q) is determined by its values on $f and i. All the pos- 
sibilities are given by Table 6. 


Table 6 


Automorphism — Value on 4D Value on i 


e 4/9 i 
Co) ey) i 
oa? —+/2 i 
co Te i 
@ ae =i 
oT 5/2) -i 
a7t = —i 
Gin =i? =) 
Thus, the Galois group 9Y(K/Q) consists of the eight automorphisms 


e, o, a”, o°, t, ot, o*t, o°t which satisfy the relations o* =e =t? and 
tot = a°. Thus [K: Q] = 8.'A basis for K over Q is 
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fi C27 (A/2)9: 9, 34/2 /2P /D). 
The element ot of G(K/Q) has order 2, since we have 

(ot)(ot) = o(tot) = 00° =e. 
Therefore {e, ot} is a subgroup of Y(K/Q). We shall determine the fixed field 
of this subgroup. An element & € K may be written 

eee cot 2 C52 /2)? 404(4/2)? 
sis ieticad <2 tea) ce (2/2)? 

We can compute oté directly as 


ot =C, + Ci 4/2 = BeDy — C4 i(¢/2)3 
= csi + €,4/2 Me, He /2)7 =c,(4/2)°. 
If € belongs to the fixed field of {e, ot}, then ot = € and we must have 
C2 = C6, C3 = —C3, C4 = —Cg, Cs = —Cs, OF in other words, c; = 0 =c, and 
E = cy + (1 + i)9/2 + cy 19/2)? + cali — 18/2)? 
mee, Hic +1)2/24 45(1 +.1)7(2/2)7 4 Foal 4 )7(4/D)*. 
It follows that the fixed field of {e, ot} is Q((] + ne) 2): This is not a Galois 
extension of Q, since {e, ot} is not a normal subgroup of ¥(K/Q). 


133a. In the example above, justify the implication that e, o, 0”, o°,t, ot, 
o?t, and o°*t are automorphisms of K that leave Q fixed. 


133B. Determine the Galois group 9(£/Q), where E is the splitting field over 
Q of x4 + x? -6. 


133y. Find the Galois group of the smallest Galois extension of Q containing 
/2 + 3/2. 

1335. Determine the Galois groups which may occur for splitting fields of 
cubic equations over Q and give an equation for each case. 


134. The field of n-th roots of unity. Let E denote the splitting field over Q of 
x" — 1. The roots of x” — 1 are the complex numbers 


1k te ees ale 


where ¢ = e*""". These roots form a group themselves—the group K,, des- 
cribed in 44. Since E is a splitting field, it is clearly a Galois extension of Q. 


2ni/n 
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Furthermore, { is a primitive element; that is, E = Q(¢). Consequently, each 
automorphism @ of the Galois group Y(E/Q) is completely determined by its 
value on ¢. Since ¢ can only permute the roots of x" — 1, we must have 
ot = ¢* for some k such that 1 < k <n. Not every such k will do: if (k, n) =d 
and d> 1, then 


PCr’) = (Cr/)E = (AY" = 1 = G(1) 


and ¢ could not be one to one. Therefore if ¢ € Y(E/Q), then f = ¢*, where 
(k, n) = | and ¢* is a primitive n-th root of unity (44). There are ¢(n) primitive 
n-th roots of unity (where ¢@ denotes the Euler totient function of 25). It 
follows that Y(E/Q) can contain at most ¢(n) distinct automorphisms, andasa 
result [E: Q] < ¢(n). To see that [E: Q] is exactly @(n), we prove the following 
theorem. 


Theorem. A minimal polynomial over Q for € = e?"" has every primitive 
n-th root of unity as a root. 


Proof. We may factor x” — 1 as a product of polynomials which are irre- 
ducible over Q and which have integral coefficients (106). One of these factors, 
call it f, must have { as a root. Since/ ‘is irreducible, it must be a minimal poly- 
nomial for € over Q. What is more, because f and all the other factors have 
integral coefficients, it follows that f is monic (has leading coefficient 1). We 
note for future reference that this implies 


fx = (x — @)(x — ©) ++ & — @), 


where @,, @2, ..., @, are the roots of f. 

Let f, denote the polynomial over Z given by f, x = f(x*). Since fis monic, 
we may invoke the division theorem for polynomials over Z (99a) to write 
jf, uniquely asf, = 9, f+ 7,, where qg, and r, are polynomials over Z and either 
deg r, < deg f or r, =0 (which means that f|/,). Next, we observe that r, 
depends only on the congruence class of kK modulo n. Indeed, if k = / mod n, 
we have ¢* = ¢', and consequently, 


FOE) -/O =f -fib =0 


and ¢ is a root of f, —f;, from which we conclude that /| (f, — f,) andr, =r;. 
Therefore each r, equals one of the polynomials r,,r2,...,7,- 

Let v be a natural number exceeding the content (106) of all the polynomials 
iy 2>-+-++;%,- Then it follows that r,=0 if there exists a natural number 
p > vsuch that p|r, (p divides each coefficient of r,). Now we claim: whenever 
p is prime and p > v, then p|r, and therefore r, = 0, in other words, f|f,. To 
establish this claim, we first remark that p|(/, — /?) where f? is f raised to 
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the p-th power. To put it another way, f, = /? + pA where A is a polynomial 
over Z. (Why ?) We may write A uniquely as A = Ef + p where deg p < deg for 
p =0. Now we have 


Sp = %pf +1, =(f? * + Po)S + pp, 


and by uniqueness it follows that r, = pp. Since by hypothesis p> v, p|r, 
implies r, = 0 and f|f,. We have proven our claim. 

Now we are able to show that (k, 1) = 1 implies C* is a root of f£. We would 
be finished if we knew there were a prime p such that p > vand p=k mod n. 
For then f|f, would imply that ¢ is a root of f, or equivalently, f,¢ = /(¢?) = 
S(C*) = 0. As a matter of fact, by a theorem of Peter Gustav Lejeune Dirichlet 
(1805-1859), such a prime will always exist, but we are not able to give the 
proof of this theorem, which is difficult. Fortunately there is an elementary 
argument which avoids this point. 

Let P denote the product of all the primes less than or equal to v except those 
dividing k, and set | = k + nP. Certainly /= k mod nand ¢' = ¢*. Furthermore, 
primes dividing / must all be larger than v since primes less than or equal to v 
divide either k or nP, but not both. As a result 


[= P,P2°** Ds, 


where p; is prime and p; > v fori =1, 2,..., s. By our previous argument we 
know that /|f,, for each p; in the factorization of /. Since ¢ is a root of fand 
Sf \fp,, it follows that ¢ is a root of f,, or that f, (C) = /(¢?') = 0. Since we now 
have C?' is a root of fand /|f,,, it follows that C?' is a root of f,,, or that 


Sole?) = $0") = 0. 


In s steps of this kind, we obtain (?'?2"" Ps = ¢' = ¢* is a root of f and the 
proof is complete. 


Corollary. If E is the field of n-th roots of unity over Q, then [E: Q] = ¢(n) 
and the Galois group GE/Q) is isomorphic to Z,,. 


Proof. It is clear from the theorem and the discussion preceding it that 
[E: Q] = ¢(n). An isomorphism ¥(E/Q) — Z; is given by ¢ > [k], when ¢ is 
determined by $f = C*. If d€ = C« and WC = ¢', then 


(PW)E = PCW) = H(C') = OM, 


and consequently, dw — [k/], = [k],[/],, which verifies that the mapping is a 
homomorphism. It is evident that this homomorphism is also a one-to-one 
correspondence. 
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It follows from the theorem that a minimal polynomial for € over Q is 
given by multiplying together all the factors x — ¢* where (k,n) =1 and 
1 <k <n. This polynomial, denoted ®,, is called the n-th cyclotomic poly- 
nomial (cyclotomic means “circle dividing ’’). ®, is a monic polynomial which 
is irreducible over Z and deg ®, = ¢(n). Gauss was the first to show irredu- 
cibility of ®, for p a prime. Many proofs of irreducibility for this special case 
(see 107B) and for the general case have been found. A detailed survey of those 
given up to 1900 may be found in Ruthinger, Die Irreducibilitatsbeweis der 
Kreisteilungsgleichung, (Inauguraldissertation, Kaiser Wilhelms Univer- 
sitat, 1907). The proof of the theorem above is an adaption by Artin of an 
argument of Landau which appears in Vol. 29 (1929) of the Mathematische 
Zeitschrift. 


1340. Show that C* is a primitive (n/d)-th root of unity if and only if (k, n) = d. 
Apply this to prove that 


x"—1= || Ox). 
d|n 


1348. Use the formula of 134a to compute ®,(x) for 1 <n < 10. 


134y. Let mandn be natural numbers such that every prime p dividing m is 
a divisor of n. Prove that ©,,,(x) = ®,(x”). Use this to compute ®,,, ®3,, 


1345. Let E = Q(o) where € = e?"'" and n is odd. Show that E contains ail 
the 2n-th roots of unity. 


134. Prove that ®,,(x) = ®,(—~x) when n is odd. 
134. Show that for any n-th root of unity, 


n when wm = I, 


2 he ni > 
aa ae ae (0 weet 


134n. Determine [Q(cos 2zr): Q] for re Q. 


1348. Prove that the cyclotomic polynomial satisfies 
®,(x) = ess To pi 
d|n 


where p denotes the Mobius function (258). 


1341. Using 134, y, &, and @ and the results of 1348, compute ®,(x) for 
1) = 36. 


135. Theorem. A regular polygon ofn sides is constructible with straightedge 
and compass if and only if $(n) is a power of 2. 
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Proof. We observe that the construction of a regular polygon of n sides is 
equivalent to the division of a circle into m equal arcs. Such a division of the 
unit circle in the complex plane is equivalent to the construction of the n-th 
roots of unity. Thus, a regular polygon of n sides is constructible with straight- 
edge and compass if and only if the splitting field E over Q of the polynomial 
x" — lis aconstructible field. By 120 E is constructible only when the number 
[E: Q] = ¢(n) is a power of 2. 

On the other hand when ¢(n) = 2", the Galois group (E/Q) = Z/ has order 
2" and by 74 is a solvable group. Explicitly, there is a composition series 


{e} = Gyo © G, c+ CG, = HE/Q) 


in which o(G,) = 2'. We let E; denote the fixed field of E under G,_;. Then we 
have a tower of fields, 


OF ££, o-°r'e b, =; 


in which each term is a Galois extension of any of the preceding terms. 
Furthermore, 


fe: E,2;)= (GE, ¢7-;: G, =] = 2. 


A finite induction, using the fact that a quadratic extension of a constructible 
field is constructible (See 120), shows that E is constructible. 


136. The preceding theorem leads us to determine the values of n for which 
the Euler totient function ¢(n) is a power of 2. It follows from 25 that n must 
have the form 2'p,p,--: p, where p,, p2,-.--, Py are the distinct odd primes 
dividing n and where ¢(p;) is a power of 2 for i= 1, 2,..., k. Since $(p;) = 
Pp; — 1, we can reduce the problem to the question of finding all primes of the 
form 2” + 1. 

We note that 2”+ 1 is prime only if m itself is a power of 2. In fact if 
m = uv where v is an odd number, then we have 


Pee athe AY SO ee 4 1), 


In other words, m cannot be divisible by any odd number and must be a 
power of 2. Our problem is now reduced to the question of finding all the 
primes of the form 2?" + 1. Fermat (1601-1665) conjectured that all the 
numbers 27° + 1 are prime, and such numbers are frequently called Fermat 
numbers. For q < 5 the number 2° + 1 is prime, but in 1732 Euler (1707-1783) 
discovered that 


27> + 1 = 641 x 6,700,417. 
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For no value of q above 4 is 27* + 1 known to be prime, and for many values 
it is known not to be. 


Table 7 
q Os feng Ged ar ile a 5 
27° solved Pe 5, | a? | 7287 | 65,537 | 4,294,967,297 


In summary, we can say that a regular polygon of n sides is known to be 
constructible whenever n = 2" s@ s{' --- s4* where r>0, ¢;=0 or 1, and 
s,=27'+1,i=0, 1, 2, 3, 4. 

The constructions of the equilateral triangle (7 = 3) and the regular penta- 
gon (n= 5) were known to the ancient Greeks. The construction of the 
regular heptadecagon (n = 17) is a discovery of Gauss, who requested that a 
regular heptadecagon be inscribed on his tomb. The construction of the 
regular 257-gon was carried out by Richelot in 1832. Professor Hermes of 
Lingren devoted ten years of his life to the construction of the 65,537-gon. 
His extensive manuscripts reside in the library at Géttingen. Although many 
valuable works were destroyed in the flooding of this library, a result of 
bombings during World War II, Professor Hermes’s work was untouched. 


1360. List the regular polygons of 100 sides or less which are constructible 
with straightedge and compass. 


136B. Suppose that the regular polygons of m sides and u sides are construc- 
tible with straightedge and compass. Prove that a regular polygon of [m, n] 
sides is constructible. 


137. The Regular Pentagon. As a concrete illustration of the preceding 
articles, we take up construction of the regular pentagon. 
Let E denote the splitting field over Q of the polynomial 


x —1=(x-)NOttx24+x7 4x41). 


E = QS) where £ = e?"/5 = cos 72° + i sin 72°. The Galois group G(E/Q) isa 
group with [E: Q] = 4 elements o,, 0,, 03,04, each of which is completely 
determined by its value on ; 0;6 = C'. The group Y(E/Q) is actually a cyclic 
group of order 4 generated by o, . If we write o for o,, thena? = 04,0° =03, 
and o* = ,, which is the identity. In other words, 4(E/Q) = {1, a, a”, 0°} 
where of = C7. 

The only proper, nontrivial subgroup of G@E/Q) is the normal subgroup 
H = {1, 07}. Consequently, if B is a field between Q and E, then by the 
fundamental theorem of Galois theory, G(E/B) is a subgroup of Y(E/Q), and 
in this case it must be that G(E/B) = H (unless B = Q or B = E). This means 
that we can determine B as the fixed field of the group H. 


Galois Extensions 125 


The numbers (, 67, 63, (* form a basis for E over Q. (Why?) An element 
a=alt+a,l?+a,0+a,¢* of E belongs to B if and only if 2a =«. We 
compute 


Pa=alr+t+a,C+t+az,l? +a,6. 


Thus, o?a = « if and only if a, = a, and a, = a;.In other words, « € Bif and 
only if « = by, + 6.4, where n, =€+(C* and ny, =¢? + C?. We note that 


mMem=C40404 05-1, 
mn2= (C+ O60 4 OE) = 04044040 = -1. 


(This follows from the fact that ¢ is a root of x* + x? + x? + x + 1, and con- 
sequently, (* + €° + (7 +¢+1=0.) We now see that y, and y, are roots of 
the polynomial over Q, 


(x — m)(x — no) = x? — (my + 2)x +n, =x? 4-1. 


Solving this quadratic and noting the position of the roots of unity ¢, (7, 6°, ¢* 
on the unit circle (Figure 14), we see that 


= ~444,/5 =2 cos 72°, 
iD = -4-4,/5= —2 sin 72°. 


It follows that B = Q(,/5). For the sake of completeness we observe that (is a 
root of the equation x*— 4, x +1=Oover B. , 


Figure 14 


Let us see how we may use this information to construct a regular pentagon. 
To construct ¢, it is sufficient to construct cos 72° = y,/2. This is easily 
accomplished as shown in Figure 15. 
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Figure 15 


Beginning with the unit circle centered at the origin O = (0, 0) and the four 
points A = (1,0), B=(0, 1), C=(—1,0), and D =(0, —1), the midpoint 
E =(0, —1/2) of the radius OD is constructed. The line segment AE has 
length O/5/2 Next, the point F on OB is constructed so that EF has the same 
length as AE. Then the length of OF is 


m = -44+4,/5. 


The point G = (cos 72°, sin 72°) is determined by the perpendicular to OA 
through F’, a point whose distance from O is cos 72° = n,/2. (In a simpler 
construction one observes that AF has the same length as one side of the 
pentagon.) 


137%. Analyze completely the Galois extension E = Q(¢) where ( = e?"!/’; 
determine all intermediate fields, whether or not they are Galois extensions of 
Q, and all the Galois groups involved. 


137B. Construct a regular polygon of 15 sides. 


138. The Regular Heptadecagon. The construction of the regular hepta- 
decagon follows the same pattern ds the construction of the regular pentagon. 
However, the additional complexity is enlightening, and we shall sketch the 
algebraic preliminaries in this article. 

Let E denote the splitting field over Q of the polynomial 


x!? —1=(x—1)Q'o+ x + 


The roots of x!’?—1 are the complex numbers 1, ¢,¢?,...,¢1°, where 
C=e7/17 F = QO) and [E: Q] = 16. 
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The Galois group Y(E/Q), which is isomorphic to Z,,;, may be represented 
as a set with 16 elements, ¢,, 62, -..-, 6,46, where @; is the automorphism of E 
determined by ¢;¢ = C'. This group is a cyclic group of order 16 generated by 
3 as we shall see. From now on we shall denote ¢3 simply by @. Since 
ot = 63, we have 


($7) = $(90) = $0) = (GP = CP =O, 


and, in general, ($‘)¢ = C>'. However, (17 = 1 and the value of ¢* depends only 
on the congruence class of kK modulo 17. At this point we need a table of 
powers of 3 modulo 17. 


Table 8 
OMe Sh 4a S 6 |) o7 ie 8 eS 10] 11 p12 Wis | 144 1S 


3'(mod 17)}1 |} 3} 9 |10| 13] S5|15]11]16]14] 8] 7] 4/12] 2] 6 


From this table we can see that f° = ¢,, 6! = $3, ¢? = ¢o, and so forth; and 
we can verify that the powers of ¢, that is, 1 = 6°, ¢', ¢”,..., 61°, run 
through the set ¢,, 62, ..., Gig. It also serves to interpret @; as a power of ¢. 


In what follows we let £; denote ¢'¢ = C3. We note that 
ot, = 0'(9'D) = b'*0 = 6,45. 


This fact is very convenient for making computations. (Table 8 may be used to 
convert C, to a power of ¢ and vice versa.) 

The group &(E/Q) = {1, ¢, ?, ..., ¢'°} has three proper, nontrivial sub- 
groups, each of which is normal, since Y(E/Q) is abelian. These subgroups are 


G, = {1, ¢°} order 2, 
G, = {1, o*, 6°, ¢'7} order 4, 
Gy= {1, ¢°,.¢*, ..., 6° order 8. 
In fact, the series 
iW) = @.cG, 6G, <G,66y= W£/Q) 


is the one and only composition series which Y(E/Q) admits. 

By the fundamental theorem of Galois theory there are three intermediate 
fields: the fixed fields B,, B, , and B, of G,, G,, and G3, respectively. We have 
a tower of fields 


Wes, 28,62, @ E. 
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We note that the complex numbers ¢, ¢?,..., ¢1° form a basis for E over Q, 
or in other words, the numbers C,, ¢,, ..., 6,5 form a basis. Furthermore, 


CoG Meee SI Et Cpe Maral tis = ale 


(Why?) With these facts at our disposal we are ready to determine the fields 
B,, B,, and B,. 

B, is the fixed field of G; = {1, 6, $*,..., 6'*}. Since G; is a cyclic group 
generated by 2, it follows that a € E is fixed by every element of G, if and 
only if it is fixed by #2. In other words, 


B, = {ae E| da =a}. 
Using the basis Cy, £,,..., 6,5, we write 
@= alg t+ arty +o tars cis, a,€Q. 
Then 
7a = aol, +ayls +77 +. O35 bis + Aygo + Aish. 


(Note that $70,4=646=©'° =C =C, and $76,, =(,.) Now we see that 
$72 =a and we B; if and only if 


Ap =A, = Ag =" =A, 
and 

Oe ee 
Thus, a € B; if and only if « = ag + ayn, where ay, a, € Q and 
No = lo tla tla ti tl HCH Ot CP + OP 4 8 4 8 4 eh 4 
mat testes tt ls HOt Oe C4 Ot 4 C4 OT 4 + OF. 
Now 

NMotm=lCoth to +05=-—1. 


Furthermore, 751, is left fixed by 6? and must be expressible as ayn) + an, 
for some dy), a, €Q. Multiplying the expressions for yo and n, yields 


Nom = 4No + 4m, = —4. 


Thus, n,'and n, are roots of the polynomial x? + x — 4, and B, = Q(n) where 
n is either root yo or n, of the polynomial. Of course we can solve the equation 


x? + x —4=0 to determine 
y= jl ty 17 
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from which it follows that B;= Q(/17). The elements yo and n, are called 
periods of length 8, since each is a sum of eight of the roots C5, ¢,, ..-5 C15. 

The fields B, and B, are determined in a similar fashion. A basis for B, over 
Q consists of the four periods of length 4, 


fCo=lCotlatla t+ lia, 
CHO tls tlot fis, 
G3 = Ga + Ge Cio + Cia 
C303 t+07 +0 +s. 


Direct computation shows that the following relations hold among the ¢;: 


fo + 62 = No» Gi +¢3 =m, 
Coo2 = —1= C163, 

€1 = (fo — I/(Eo + 1), 

Qo = (E, = Vita 1) 

€3 = (62 — IME. + 1d, 

fo = (63 — 1)/(E3 + 1). 
Thus, &, and €, are roots of the polynomial x? — ny x — 1 over B;, and ¢, and 
&, are roots of x? — y,x — 1. Consequently, B, = B;(€) where € is a root of 
x? — yx -1. 

A basis for B, over Q consists of eight periods of length 2: 


Ag =lot bs, Ag=Ca+ Cir; Ao + Ag = C0, 
A, =C, + fo, As=Cst+Ci3, Aytas =, 
A, =624+ Cro; Ag =bo t+ Cia, A, +46 =62; 
Ag =C3 4+ Ci, A, =0,4+%5, Ax +4, = 63. 
At this stage it is easy to show how to multiply these periods: 
Ag Ag = (Co + Sala + S12) = (0 + 2°08? + C4) 
= (14 4 75 4 729 4.20 
afi* anf? 4 p24 0 
=lCotlstlis tli 
= €,. 
Similar computations show 4,4, = €,, 424, = €3, and A,A, = &. Conse- 
quently, 
A, and A, are roots of x? — 9x + , over Bz, 
A, and 4, are roots of x? — €,x + €, over B,, 
A, and A, are roots of x? — €,x + &; over Bp, 
A, and A, are roots of x? — 5x + & over Bp. 
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Using the relation between €, and & above, we see that A) and A, are roots of 
(9 + 1)x? — (5 + IEox - (Go =A) 


We conclude (without giving details) that B, = B,(A) where 2 is a root of the 
polynomial 


(Sep le aC cx eral 


over B,. 

Finally the same kind of analysis shows that E = B,(C) where € is a root of 
x? —Ax + 1 over By. 

In summary we have worked out the following relations between the fields 
OB, WR Be TE: 


E = B,(0) where € is a root of x* — Ax + 1 over By, 
B, = B,(A) where A is a root of (€ + 1)x? — (€ + 1)éx —  — 1) over B3, 
B, = B,(é) where ¢ is a root of x? — nx + 1 over B3, 
B, = Qn) where 7 is a root of x7 + x — 4 over Q. 


We could use this information to formulate a geometric construction of the 
regular heptadecagon, but there is little interest in actually doing so. Many 
constructions are available to the reader. (See Eves, A Survey of Geometry, 
Vol. I, p. 217, or Hardy and Wright, An Introduction to the Theory of Numbers, 
Pp. 57.) 


Solvability of 
Equations by Radicals 


139. Let f be a polynomial over a number field F. The equation fx = 0 is 
solvable by radicals if all the roots of fcan be obtained fromelements of F by a 
finite sequence of rational operations (addition, subtraction, multiplication, 
and division) and extractions of n-th roots. 

For example, the sixth-degree equation over Q, 


x® — 6x* + 12x? —15 =(x? — 23 -7=0, 


is solvable by radicals. In fact all six roots may be expressed as Yee 7 
provided we interpret a), as any of the three cube roots of 7 and 2 + 2/7 as 
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either of the square roots of 2 + SF. Quadratic, cubic, and quartic equations 
are solvable in radicals. 

In 1799 Paolo Ruffini (1765-1822) tried to prove the existence of quintic 
equations not solvable in radicals. The argument Ruffini gave was inadequate, 
and the question was settled decisively by Abel in 1824. Galois gavea necessary 
and sufficient condition for solvability of an equation of any degree by radicals, 
which dramatically supersedes the work of Abel and Ruffini. 


140. The simplest and most clear-cut case of an equation over a number 
field F which is solvable by radicals is the equation x” — a = O where ae F*. 
We have already examined the special case F = Q and a = 1 in 134. Now we 
take up the general case as preparation for the Galois criterion for solvability 
by radicals. 

Let E denote the splitting field of x” — a over F. If B is a root of x” — a, then 
the other roots are BC, BC?, ..., BC" 1, where € = e?*'/". Since B and BC belong 
to E, it follows that € = (BC)/B belongs to E, and therefore E contains all the 
n-th roots of unity, 1, ¢, (?,..., ¢"~'. Clearly, E = F(C, ). 

E is a Galois extension of F by 132, and since E = F(, B), each element of 
the Galois group Y(E/F) is determined by its value on the two elements { and 
B. If 6 € GY(E/F), then ¢ must carry £ to (* where (k, n) = 1. (This is shown by 
the argument of 134.) On the other hand, ¢ can only permute the roots P, ¢, 

., BC"! of x"—a, so that #(f) = BC'. Thus, the two numbers k and / 
determine the automorphism ¢ completely. In general only certain values of 
k and / will give elements of G(E/F). 


140a. Determine the Galois group Y(E/F) where E is the splitting field over 
F of x® — 8 for the cases when F = Q, F= Q(,/2), and F = Q(w) where 


e2tt/3 — —$44,/-3. 


141. Theorem. If E is the splitting field of the polynomial x" — «a over a 
number field F, then the Galois group G(E/F) is solvable. 


Proof. By the analysis of 140, E = F(C, B) where € =e?" and £ is a root of 
x" —a. Let B= F(C). Then B is the splitting field of x" — 1 over F and is a 
Galois extension. By the fundamental theorem of Galois theory (130) we 
know that G(E/B) is a normal subgroup of GY(E/F) and that G(B/F) is the 
quotient group. Y(£/B) contains just those automorphisms of G(E/F) which 
leave € fixed. In terms of 140, ¢ ¢ G(E/B) when k = 1. Consequently, an 
automorphism @¢ € Y(E/B) is completely determined by the number / where 
$(B) = BC'. In fact the assignment ¢++ / identifies Y(E/B) with a subgroup of 
the finite abelian group Z, . (Why ?) It follows that G(E/B) is a solvable group 
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(75). Next we want to see that G(B/F) may be identified with a subgroup of 
Z;,, and this will show that Y(B/F) is solvable. From the proof of the funda- 
mental theorem (130), we recall that the epimorphism 4(E/F) — O(B/F) is 
given by the assignment ¢++ @|B. However, the restriction ¢|B of any 
¢ € GE/F) is completely determined by the number k of 140. The mapping 
given by (¢| B)r+& identifies G(B/F) with a subgroup of Z,,. Now we have that 
G(E/F) is solvable, since the normal subgroup 4(£/B) and the corresponding 
quotient group 4(B/F) are solvable (75). 


142. A radical tower over F is a tower of number fields 
P= Foe hc: #, 


in which, fori=1, 2,..., , F; is the splitting field of a polynomial x — «; 
over F;_,. Such a tower is Galois if the top field F, is a Galois extension of the 
ground field Fy. 


Proposition. Every radical tower can be embedded ina Galois radical tower. 


Proof. We shall show that given a radical tower over F, 
F=Fyck,c-:'cF,, 
we can construct a Galois radical tower over F, 
F=f, cf us. ere 


such that F, c F,,. We begin by setting F, = F,. Since F, is the splitting field 
of x*! — a, over Fy, by 132 F, is a Galois extension of Fo. (If n = 1, we would 
be finished—the two-story tower Fy ¢ F, is Galois.) Let the Galois group of 
F, over Fy be 


G(F, |Fo) Fe {¢1, p2 : GME) ?,}- 


F, is the splitting field of x** — x, where a, e F, = F, and a, is algebraic over 
F,. Now we let F, be the splitting field of x*? — ¢,0, over F,, F; the splitting 
field of x*? — $, a, over F,, and so forth, down to F,,,, which is the splitting 
field of x*? — d,a, over F,. Now the field F,,, is a Galois extension of F 
because it is the splitting field of the polynomial 


(xt — at, (x on 1%2)(x"? — 202) +++ (x? — Ga), 


all of whose coefficients lie in F. (Why?) Furthermore, a, is among the 
numbers $02, $2 %2,---, @,@, and consequently, F, ¢ F,,,. (If n = 2, we 
are finished.) 
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This argument can be continued, extending the Galois radical tower 
=F ek pene 


to anew one containing F; . In performing this next stage of the construction, 
it is necessary to use the Galois group 4(F,,,/Fo) in place of G(F,/F) and a, 
in place of a, but otherwise the argument is similar. The entire argument 
is iterated until a Galois extension F,, containing F, is reached. 


143. A number field £ is a radical extension of a number field F if there exists 
a radical tower over F, 


Hf fic: F,, 


such that E c F,. In view of the preceding proposition, we may assume with- 
out loss of generality that F, is a Galois extension of F. 


Proposition. If D is a radical extension of E and E is a radical extension of F, 
then D is a radical extension of F. 


Proof. Let 
ES Pes SP 
and 
B—-mclc ee £, 


be radical towers such that Ec F, and Dc E,,. Suppose that E;1is the split- 
ting field of x*‘ — a, over E;_,. Then we let F,,; be the splitting field of x‘ — a, 
Orem 4:2, im Order to define. inductively fields F,,,, Fysay ---> Faam- It 
follows that 


FPEePoCh, CCK, CK 4, Oo C Fam 


is a radical tower over F. Furthermore, for i=0, 1, ..., m, we have 
~c F _,. Hence, D < E,,  F,.,,, and D is therefore a radical extension of F. 


144. Proposition. Ifanumber field E is a Galois extension of a number field F 
and the Galois group GE/F) is cyclic, then £ is a radical extension of F. 


Proof. First we shall prove the proposition for the special case that F con- 
tains the n-th roots of unity, 1, ¢,...,¢” ' form = o(G(E/F)). The general case 
will follow from the special case. 
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By 129 we have E = F(@) for some element @ which is algebraic over F. We 
define the Lagrange resolvent (¢*, 0) € E of @ by C* with the formula 


8) = YH a'e) = 0 + (G0) += + OMEN), (1) 


where G(E/F) = {1, ¢, ¢?,..., @” '}. Next we compute the sum of all the 
Lagrange resolvents of 0: 


=] 


n-1 Cl NT al | n n-1 
Lea= > Yorn y (Lor) =n. — Q) 
k=0 k=0 i=0 i=0 k=0 


The last step in this computation is justified by the observation: 
n, fori=0Omoda, 


n-1 
IN’ i sae EV — (rin 
= Tea Sane BOs) mse), 


Consequently, we have 
1 "721 
o=- Y (8). 
Nk=0 


Now we observe how the Lagrange resolvents behave under the automor- 
phisms of the Galois group Y(E/F). Since ¢ leaves F fixed, we have $(¢') = ¢' 
for each n-th root of unity. Consequently, we can compute directly that 
pct, 8) = (C*, G6) = C-*(%, 8), 
and what is more, 
oct, ay” = (C-)C*, 0)" = (CF, 8)". 


In other words, ¢@ leaves the number «, = (¢*, 0)" fixed, As a result, all the 
elements of Y(E/F) leave fixed each of the numbers a, fork =0,1,...,n—1, 
and consequently these numbers belong to F. Now we can construct induc- 
tively a radical tower 


F=F)cF,c:'cF, 


by taking F;,, to be the splitting field of x” — a; over F;. Then F, contains all 
the Lagrange resolvents of 0, and hence, 


6-1 "S40 
er. : 


belongs to F,. Thus, E = F(@) c F,, and we have shown that E is a radical 
extension of F. 
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The general case, in which F need not contain all the n-th roots of unity, is 
proved from the special case as follows. Since E = F(@) is a Galois extension of 
F, E is the splitting field of a minimal polynomial for @ over F, call it f. The 
field F(¢, 0) is the splitting field of the polynomial g over F given by gx = 
(x” — 1)(fx). By 132, F(C, 8) is a Galois extension of F. It follows from 130 that 
F(C, @) is a Galois extension of F(¢). A homomorphism of Galois groups 


H: WFC, 8)/F(C)) > WE/F) 


is defined by H(W) = wW|E (the restriction of w to E). It is not difficult to 
verify that w is well defined, preserves composition of automorphisms, and is 
one to one. Asa result the group (F(C, @)/F(C)) is isomorphic to Im H, which 
is a subgroup of the cyclic group Y(E/F), and hence, by 43 is itself a cyclic 
group. Therefore G(F(C, 0)/F(C)) is a cyclic group, and its order m (which is 
also the order of Im #) divides n, the order of G(E/F). Thus F(C, 6), as an 
extension of F(Q), satisfies the hypothesis of the proposition: its Galois group 
is cyclic of order m. Furthermore, F(¢) contains the m-th roots of unity, 
1c, c2nm com 1)n/m. because m|n. We have the situation of the special 
case, and may conclude that F(C, 0) is a radical extension of F(¢). Now F(C) is 
quite clearly a radical extension of F, and by 143, so is F(¢, 6). If F=Fyo < 
F, <::: ¢ F, is a radical tower such that F, contains F(C, 6), then F,, contains 
E = F(@), which is a subfield of F(¢, @). Thus E is a radical extension of F. 


144a. Let E be a Galois extension of a number field F, with the property 
that B, c B, or B, c B, for any two intermediate fields B, and B,. Show that 
E is a radical extension of F. 


144B. Let E be a Galois extension of a number field F with Y(E/F) abelian. 
Show that E is a radical extension of F. 


145. Theorem (Galois). Let f be a polynomial over a number field F and let 
E be its splitting field. The equation fx = 0 is solvable by radicals if and only 
if the Galois group GE|/F) is solvable. 


Proof. A moment’s reflection reveals the equivalence of the two state- 
ments: 


(1) the equation fx = 0 is solvable by radicals over F; 
(2) the splitting field of fis a radical extension of F. 


(In fact the second statement is often taken as a definition of the first.) We 
must prove that E is a radical extension of F if and only if G(E/F) is solvable. 
Suppose E is a radical extension of F. Then there is a Galois radical tower 


Furie fc: Ff, 


136 4 Galois Theory : 


such that Ec F,. F, is a Galois extension of F=F), and hence, a 
Galois extension of each F;. Furthermore, each F; is a Galois extension of 
F,-,. Consequently, setting G; = G(F,/F,-;), we have that 


{1} =G@) eG, c++ © G, = HF, /Fo) 
is a normal series. The factors of this series are the groups 


G; igi teai) 
= GRIF) ohn it tl Pai 
G;-1 G(F,/F,~ +1) ( n-i / ) 


by 130, and by 141, Y(F,_;+;/F,-;) is solvable. Thus, Y(F,/F)) has a normal 
series with solvable factors and therefore is a solvable group itself (756). 
Applying the fundamental theorem (130) to the fields Fc Ec F,, we have 


AFF) 


HEF) = 3 aia 


Since @(E/F) is isomorphic to a quotient group of the solvable group 
G(F./F), it follows from 75 that Y(E/F) is solvable. 
On the other hand, suppose that Y(E/F) is a solvable group. Let 


{lI} =G) oG,c::' @G,=AGE/F) 


be a composition series for G(E/F). Let F; denote the fixed field of the group 
G,,-;. Then we have a tower over F, 


F=f, ef pe. ’eF, =, 


Furthermore, ((E/F;) = G,_;. Now G,-; is a normal subgroup of G,_;4,, 
and by the fundamental theorem (130) we may conclude that F; is a Galois” 
extension of F;_, with 


HE/Fi-1) _ Ga-ist 


WF; /F;-1) © GEIF) G 


ai 


Since Y(E/F) is solvable, the group G, _ ;+,/G,-; is cyclic (of prime order), and 
therefore Y (F;/F;-,) is cyclic. By 144 F; is a radical extension of F;_,. 
Applying 143 inductively yields in a finite number of steps that E = F,isa 
radical extension of Fy = F, and the proof is complete. 


145. Let fx = 0 be an equation of degree 6 which is solvable by radicals. 
Prove that fx = 0 is solvable by the extraction of square roots, cube roots, and 
fifth roots only. 


145B. Prove that equations of degree 2, 3, and 4 must be solvable by radicals. 
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146. Quadratic Equations. The simplest possible example of the preceding 
theory is the solution of the quadratic equation 


x? —px+q=0. 


We assume that p andq are elements of a number field Fand that x? —px+q 
is irreducible over F with splitting field E. As a result [E: F] = 2, and 
G(E/F) = {1, ¢} is a cyclic group of order 2. Furthermore, F contains the 
square roots of unity, +1. If «, and a, are the roots of x? — px +q, then 
p=% +a, and q=«a,a,. According to 144, we can solve the equation by 
means of the Lagrange resolvents. We compute: 


(1, %) =a, + ga, =a, +a, =~, 
(—1, a) =a, — da, =%, —&, 
(1, %) = a + Ga, =a, + % =p, 
(—1, 0.) =a, — da, =a, — a, = —(—1, &). 
If we let € = (—1, a,) = —(—1, a), then we have 
a, = ${(1, a) + (1, %2)} =H(p + €), 
Qo = +{(, >) + (=f, a»)} = (p a ¢). 


Now the theory predicts that the squares of the LaGrange resolvents will be 
elements of F. This is obviously true for (1, a,) and (1, «,) because pe F. 
However, we also have 


e — a? — 20,02 + a3 cS (a, ap a)? ay 4a,a, =p — 4q. 


Consequently, € = +,/ p’ — 4q, and finally we obtain 


ay, % =4(pt,/p? — 49). 


147. Cubic Equations. The first case of any complexity among the examples 
of the preceding theory is the cubic equation 


x? — px? +qx—r=0. (1) 


We assume that p, q, and r are elements of F, a number field containing the 
cube roots of unity 1, p, p?. (Since we have that 


p=-t+4/-3 and p?=-4-4,/-3, 


it is enough that F contain «/=3) Let E denote the splitting field of x? — px? 
+qx—r=0. E is a Galois extension of F by 132, and G(E/F) is a per- 
mutation group of the roots, «,, «,, a3 of (1). For the sake of argument we 
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shall suppose that Y(E/F) has all the permutations of «,, «, #3, or in other 
words, that G(E/F) ~ S3 and [E: F] = o(S;) = 6. Of course S; is a solvable 
group: the composition series 


fe) © Aves; 


has cyclic factors A, of order 3 and S3/A, of order 2. Let B denote the fixed 
field of A, , that is, the subfield of E which remains fixed under all even permu- 
tations of a, a, a3. Now A; is anormal subgroup of S; , and consequently, B 
is a Galois extension of F with G(B/F) = S;/A,; and [B: F] = 2. Clearly, 


A = (a — &2)(% — &3)(a%3 — a) 


is an element of B. Since permutations of «,, a, «, carry Ato +A, they leave 
A? fixed and therefore A? e F. We can compute A? in terms of the elementary 
symmetric functions of a,, «,, “3 using the fact that 


Oy +a,+063 =P), 
yA. + H2%3 +030, = 4, 


A> a3 =P. 
In fact we have, after a lengthy computation (given below), 
A? = —4p3r — 27r? + 18pqr — 4q° + pq. (2) 


Clearly, B = F(A) and every element of B can be written in the form u + vA 
where u, v € F. Now E is a Galois extension of B with Galois group Y(E/B) ~ 
A,, acyclic group of order 3. The Lagrange resolvents for «, are given by 


G. dt) = Oy a a ar a3 = P, 
(p, 1) = % + pa, + pas ) (3) 
(p?, 1) =o, + p70. + pas. 
(We have assumed the choice of a generator for GY(E/B) which cyclically 


permutes a,, &2 , 3.) The cubes (p, «,)° and (p?, «,)? are elements of B which 
we compute as follows: 


(p, %)? = af + 03 + 03 + 3p(atar + a5 a3 + 05 0%) 
+ 3p7(103 + 2 a2 + 03.07) + 60402 03 
= (a, + a2 + a3)? + (3p — 3)(@fa, + af a3 + 05 0%) 
+ (3p? — 3)(aya3 + a2 a3 + a3 a7) (4) 
=p? — (ofa, + of a + a3 x3 + a2 05 + 05 oy + 04403) 
+ 3./-3(a20, — aa, + oa, —a, 02 + oF a, — o,03) 


(p, «1)° = p® — (pq — 3r) — 3,/-3A. 


| 
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Similarly, we may compute 


(p?, %)° = p? — $(pq — 3r) + 3,/—3A. 
Next, we note that 
(0, &4)(9?, %) = of + 03 + 03 + (p + p* (aya, + a2 H3 + &3 0%) 
z 
= p* — 3q. 


Finally to write the solutions of (1), we let €,; be-any one of the three cube 
roots of 


p? — (pq — 3r)—3./-3A 
and determine €, by 
0, =p” — 3¢. 
If we set (p, «,) = ¢, and (p?, «,) = €,, then we have 
(1, 03) — (1, o>) = (1, a) = /%; 
(0, @2) = p*(p, a) = p*é,, 
e. 2) = p(p?, 1) = pé2, 
(p, 3) = p(p, 01) = pei, 
(p?, #3) = p*(p, %) = p7E2, 


and consequently, 


a =Hp+e, +22), 
a, =4(p+ p*é, + pe), (5) 
a3 =4(p+ pt, + pe). 

Although our argument was motivated by the assumption that Y(E/F) = S;, 


all the computations involved are completely general, and therefore the equa- 
tions (5) represent the solutions of the general cubic equations (1). 


Remarks. For the special case in which p = 0 in (1), the formulas of (5) 
become much simpler: 


3/r ee . 317 r? 
Gy, 02,4, ™ cg 4°77 57 4°37’ 


where the cube roots are varied and the product of the two terms is always 
— q/3 for any root. This equation is known as Cardan’s Formula. The general 
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case can always be reduced to this special one by the substitution x = X + p/3 


in (1). 


Computation of the Discriminant, A*. The quantity 
A? = (a, — a2)"(a2 — a3)7(a%3 — a)? 


is called the discriminant because it vanishes whenever two of the roots 
1, &2, 3 are equal. To derive formula (2) above, we observe that A can be 
expressed as Vandermonde’s determinant: 


Louies 1 a, a2 


A = det |o,—a,, |= det] a Saale 


az a2 a2 | a, a4 


and therefore A? can be expressed as the determinant of the product of the 
two matrices. In other words, 


To Ty 2 
A? = det TM, j%™MU, 1% 
Tz %M, My 


= Mo M2 M4 + 20yN2 M3 — ME — TM) HZ — 17 T,; 
where 2; = a! + a} + a. Now we have (131 and 131), 


TM =1+1+1=3, 

TM, =% +2 +43 =—p, 

Tl, =a? + a2 + 02 =p? — 2g, 

Tz, = 07 +02 + a3 =p? — 3pq + 3r, 

Tl, = at + a3 + a5 = p* — 4p’q + 4pr + 29”. 
Substituting these values into the expression above for A? will yield equa- 
tion (2). 
147. Verify that the substitution x = X + p/3 in 

x? — px? +qx—r=0 

yields an equation of the form 


X?+QX-R=0. 


147B. Derive Cardan’s formulas from the solution of the cubic given by the 
formulas (5). 
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147y. Use the method of 147 to solve the following cubic equations over Q: 
xe — x? -—x+2=0, 
x? — 6x? + 1lx —6=0, 
xe+x+3=0. 


1475. Devise cubic equations over Q whose Galois groups have orders 1, 2, 
3, and 6. 


147s. Prove that a cubic equation over Q has three real roots if A? > 0 and 
one real root if A? < 0. 


147¢. Let f be a polynomial of degree 3 irreducible over Q. Prove that the 
splitting field of f is Q(A, «) where A? is the discriminant of f and « is one of 
its roots. 


147n. Show that a cubic equation irreducible over Q with three real roots 
cannot be solved by real radicals alone. 


148. Quartic Equations. Let E denote the splitting field of the quartic 
equation 


x* — px? +qx? —rx+5=0 (1) 


over F, a number field containing the cube roots of unity. Just as with the 
quadratic and cubic equations, we shall assume that Y(E/F) contains all 
permutations of the roots a,, «,, «3, a, of (1), or in other words, that 
WE/F) = S, and [E: F] = 0(S,) = 24. 

S, has A, as a normal subgroup, and A, has a normal subgroup N con- 
taining the identity e and (12)(34), (13)(24), (14)(23). NM is abelian, and 
therefore the subgroup K containing e and (12\(34) is normal. Thus, S, has 
a composition series with cyclic factors: 


{e} GOK GNCAVES,. 
We take B,, B,, and B; to be the fixed fields of A,, NV, and K, respectively. 
As in the case of the cubic equation, B, = F(A), where 
A = (a — a )(a, — &3)(a, — %4)(~%. — &3)(%2 — H4)(X3 — &4), 
and A? € F. We shall not need to compute A?—it will fall out in what follows. 
The element 6, = «,a, + «3,4 Is left fixed by NM and by the permutations 
(12), (34), (1324), and (1423). Hence 0, ¢€8B,. All the permutations of S, 
applied to 0, yield only the numbers 
6, =0,0, +030,, 
0, = 0,04, +0204, 


and 03 = a4, +0203. 
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Moreover, the elements of N are the only permutations leaving fixed all 
three numbers 0,, 82, and 03. Consequently, B, = F(6,, 02, 03). (Why?) 
Furthermore the polynomial 


(y — 0,)(y — 92)(y — 03) =y? — Py? + OV-—R (2) 


is left fixed by S,, and its coefficients belong to F. Thus, B, is the splitting 
field of (2) over F. (We call (2) the resolvent cubic of (1).) The coefficients P, Q, 
R may be computed as follows: 


P=0, + 02 + 03 = 62(0, &2, 03,04) = 49, 
QO = 0,0, + 0,0; + 030; = pr — 4s, 
R= 0,0,0, = s(p? — 4q) +r’. 


(Details are left to the reader.) The discriminant of (2) is the quantity 


(8; — 92)°(02 — 83)7(83 — 94)? 
= (a, - 03)"(aty — ot4)*(%3 — 4)7(Ory — O2)7*( — 03)"(a. — a4)’, (3) 
which is just the discriminant of A? of (1). The right-hand side of (3) may be 
computed from the formula of the preceding article. (We shall not need A for 
the solution of (1).) Of course the roots 6,, 6, , and 0, of (2) may be obtained 


from the formulas of 147. 
To complete the solution of (1), we set 


C1 = % +4, —G3— a4 = 2a, + a2) ——p, 
2 = Oy — Hy, +43 — O14 = 2m, + 43) ——p, 
C3 = Oy — 2 — 3 +4 = Am, + 04) ——. 
Since the permutations of N either leave fixed or change the sign of each é,, 
it follows that €? € B,. Direct computation shows 
gt =p’—4q+ 40, , 
C3 =p? — 4q + 402, (4) 
Next, we note that 
€15203 = B(Hy + m2 )(y + 3 )(Oy + 4) — 4 P(e, + o2)(O + %3) 
— p(y + H%2)(%, + &4) — 4p(a + %3 (ay + Og) 
+ 2p?(3a, + a2 +03 +4) —p? 
= 8a) + 8(a, +03 + a 4)u? + B(x, 43 + 05%, + Hy O2)a, + 8x,4. 0,5 
— 4p[3a? + 2a. + a3 + ag)oy + 6203 + 5%, + Ha] 
+ 2p?(20, + p)— p° 


—————— 
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= 8pat + 8r — 4plat + 2pay + 9 — m4 (p — %)] 
+ 4p?a, + 2p? — p® 
= 8r — 4pq + p”. 
Finally, we write out the solutions of (1). We take €,, €, , €, to be the square 


roots indicated by (4), in such a way that €,é, €, = 8r — 4pq + p’, and we set 


a =4(pt+e, +2 +3), 
a2 =4(p + ¢, — 2 — &3), 
a3 =4(p— ¢, + 22 — €3), 
a, =4(p—¢, —¢, + €3). 


(5) 


149. Quintic Equations. Unlike quadratic, cubic, or quartic equations, 
quintic equations are not in general solvable by radicals. To show this it is 
enough to give an example. We choose the polynomial 


fx = 2x? — 10x + 5, 


which is clearly irreducible over Q by the Eisenstein criterion (107). Let E 
denote the splitting field of f over Q. The Galois group (E/Q) is a permuta- 
tion group of the roots a, #2, %3, %4, 5 of f (132) and is therefore isomor- 
phic to a subgroup of S,. In fact, we shall show that G(E/Q) = S,. 

First we remark that 4(£/Q) must be a transitive permutation group of the 
roots of f. In other words, given two roots «; and «,, there is some ¢ € Y(E/Q) 
such that $(«;) =a,. If this were not the case, then the polynomial 


gx = (x — a(x — a’) ++ (x ~ a"), 


in which a,,@’,..., @” are the distinct images of «, under Y(E/Q), would be 
fixed by Y(E/Q) and have coefficients in Q. What is more, g would be a 
proper divisor of /, contradicting irreducibility of f- 

By elementary techniques of the calculus, we may sketch the graph of +f, 
which has the same roots as f. From Figure 16 we see that f has three real 
roots, which we call «,, a@,, and a,. The other roots, «, and a, must be com- 
plex and conjugate. 

The automorphism of C which carries each complex number a + bi to its 
complex conjugate, a — bi, simply interchanges «, and «, and fixes a3, a4, 5. 
Consequently, it restricts to an automorphism @¢ of E = Q(a,, #2, 3, @4, @s). 
Clearly, d € Y(E/Q). 

Now we have that Y(E/Q) is isomorphic to a subgroup // of S,, which is 
transitive and which contains the transposition (1, 2). By 86, #7 = S, and 
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eT) 


Graph of 2f = 2x5 — 2x + 1 


Figure 16 


G(E/Q) = S;. By 84 it follows that Y(E/Q) is not solvable. Consequently, the 
equation 


2x? = 10x po — 0 


is not solvable by radicals according to 145. 


149%. Construct a polynomial of degree 7 which is irreducible over Q and 
not solvable by radicals. 


1498. Prove that for any prime p > 3 there exists a polynomial / of degree p 
which is irreducible over Q and not solvable by radicals. 


Ring Theory 


Chapter 5 


A ring is an algebraic structure with two operations, addition and multipli- 
cation, but without all the properties required of those operations in field 
structure. Specifically, it is not required that every nonzero element of a ring 
have a multiplicative inverse. If we think of field structure as an abstraction 
of the properties of the set of rational numbefs, then we should think of ring 
structure as an abstraction of the properties of the set of integers. 

In this chapter we present the elementary theory of rings for commutative 
rings with unity. The main aim of this presentation is the proper abstract 
setting for unique factorization theorems like those for natural numbers (24) 
and polynomials (104). To show that this effort is worthwhile, the theory is 
applied to a special case of Fermat’s last theorem (175). 


ofinition and 
xamples of Ring Structure 


150. A ring is an additive abelian group with an operation (written multipli- 
catively and called the ring product) which assigns to each ordered pair (a, 5) 
of elements of R an element ab of R in such a way that: 


145 
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(1) multiplication is distributive over addition; that is, for any three elements 
a,b,ceR, 


ab+c)=ab+ac and (a+b)c=ac+ be; 
(2) multiplication is associative; that is, for any three elements a, b, ce R, 
a(bc) = (ab)c; 
(3) multiplication is commutative; that is, for any two elements a, be R, 
ab = ba; 


(4) there exists a unity element 1e€R such that la=a=al for every 
element ae R. 


Remarks. It is customary to require only conditions (1) and (2) in the 
definition of ring structure. In this case, an object satisfying all the conditions 
of the definition above is called a commutative ring with unity. Our investiga- 
tion of ring theory will be confined to such objects, and use of the definition 
above avoids tiresome repetition of the phrase “commutative ring with 
unity.” 


150a. Indicate which of the following sets are rings. Unless otherwise speci- 
fied, addition and multiplication are to be interpreted in the usual sense. For 
those which are not rings, specify which property of ring structure fails to 
hold. 


(a) The set of integers, Z. 
(b) The set of even integers, 2Z. 
(c) The set of congruence classes mod n, Z,. 
(d) The set of rational numbers, Q. 
(e) The set of positive rational numbers, Q*. 
(f) The set of real numbers, R. 
(g) The set of complex numbers, C. 
(h) The set of imaginary numbers. 
(i) The set Z(./ —3) of numbers a + b,/ —3 where a, be Z. 
(j) The set F [x] of polynomials in x over a field F. 
(k) The set of polynomials over Z. 
(1) The set of primitive polynomials over Z (105). 
(m) The set of all 2 x 2 matrices with real entries. 
(n) The set of all continuous functions from R to R. 
(o) The set of all power series with real coefficients. 
(p) The set of all rational numbers with denominators not divisible by a 
given prime. 
(q) The power set 2* (14) of a set X, with union as addition and inter- 
section as multiplication. 
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(r) The power set 2* of a set X, with the symmetric difference as addition 
and intersection as multiplication. 

(s) The set &(G) of endomorphisms (60) of an abelian group with addition 
defined by (¢, + $2)9 =(¢,9) +(¢2g9), and with composition as 
multiplication. 

(t) The set of integers Z with addition © and multiplication ® defined by 


a@®b=a+b+1 and a@®b=ab+a+b. 


150B. An algebraic structure satisfying the definition of ring structure except 
for commutativity of the product (statement (3) of 150) will be called a 
noncommutative ring. If R is a noncommutative ring, let a new multiplication 
on R be given by ax b =ab + ba for every pair of elements a, be R. When 
is R with this new multiplication a ring as defined in 150? 


150y. An algebraic structure satisfying the definition of ring structure except 
for the existence of unity (statement (4) of 150) will be called a ring without 
unity. If R is a ring without unity, define addition and multiplication on the 
set Z x R by 


(m, a) + (n, b) = (m+n, a+ b) 
and 
(m, a)(n, b) = (mn, mb + na + ab). 


Show that Z x R is a ring with these operations and has a unity. 


151. The additive identity element of a ring is called the zero (element) and 
denoted 0. The multiplicative identity element is called the unity (element) and 
denoted |. The additive inverse of an element a is written —a. Clearly, 
Oa = 0 and (—1)a = —a for any ring element a. 

A set with a single element has a unique addition and multiplication under 
which it is a ring. Such a ring is called trivial or null. We write R = 0 to indicate 
that a ring R is trivial. In a trivial ring 1 = 0, which is to say, unity and zero 
coincide. A nontrivial ring must contain some nonzero element a, and since 
la =a #0 = 0a, we conclude that | 4 0. To summarize: a ring is trivial if 
and only if unity is zero. 

In a nontrivial ring a nonzero element a may have a multiplicative inverse, 
that is, there may exist an element a! such that aa~' = 1 = a~'a. Such an 
element is called a unit of the ring. Clearly, the set of units in a ring forms a 
group under the ring product, and this group is called the group of units of 
the ring. A nonzero element which has no multiplicative inverse will be 
called a proper element. Thus, the elements of any ring are divided into three 
classes: zero, units, and proper elements. 
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A field is just a nontrivial ring in which every nonzero element is a unit, 
or equivalently, afield is a nontrivial ring without proper elements. Had we 
defined ring structure before taking up fields, this would have been our de- 
finition of field structure. 


15le. Show that commutativity of addition in 150 is a redundant assump- 
tion by expanding (a + 1)(6 + 1) in two ways. 


151B. Show that commutativity of multiplication in a ring is equivalent to 
the assumption that (a + b)? = a? + 2ab + b? for every pair of ring elements 
a, b. 


15ly. Show that the unity element of a ring is unique. 


152. The following conditions on a ring R are equivalent: 


(1) if a,be R and ab = 0, thena=0orb=0; 

(2) if a,b, ce Rand a #0, then ab = ac implies b = c; 

(3) the set R* of nonzero elements of R is closed under ring multiplication, 
that is, a, b € R* implies abe R*. 


A ring which satisfies one, and hence all, of these conditions is called an 
(integral) domain. Obviously, a field is an integral domain. An element a of a 
ring R is a divisor of zero if ab = 0 for some nonzero element be R. We can 
phrase the definition as: an integral domain is a ring without divisors of zero 
(except zero itself ). 

The most prominent example of an integral domain is the ring of integers 
Z. The subrings of the complex number field C form a particularly important 
class of examples, which we shall call number domains. Of course, every 
number field is a number domain. 


Remark. The word subring has the obvious meaning: a subset of a ring 
which is a ring under the inherited sum and product. It does not, however, 
enjoy the same significance in ring theory as the concept of subgroup in 
group theory. The notion of subring is much less important than that of 
ideal, to be introduced shortly (157). 


152a. Show that a unit element of a ring cannot be a zero divisor. 


1528. Show that the product of a zero divisor and any ring element is a 
zero divisor. 


1527. Let aand b be elements of a ring whose product ab is a zero divisor. 
Show that either a or b is a zero divisor. 


1525. Give examples to show that the sum of two zero divisors need not be 
a zero divisor. 
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152e. Let R, and R, be rings. We define addition and multiplication on the 
cartesian product R, x R, by the rules 


(a,, a2) + (6,, bz) = (a, + by, ay + b2) 
and 
(a, 42)(b,, by) = (a,6,, a2 bp). 


Verify that R, x R,. with these operations is a ring. (The ring obtained this 
way is called the direct product of R, and R, and denoted R, x Rj.) Is the 
direct product of integral domains an integral domain? 


153. The Ring of Integers Modulo n. We already know (33) that Z,, the 
set of congruence classes modulo n, is an abelian group under the addition 
[a], + [5], = [a+ 5),. For n = 1, Z, isa trivial group. If > 1 and multiplica- 
tion in Z, is defined by [a],[b], = [ab],, then Z, is a ring as the reader will 
easily verify. From 89 we recall that Z, is a field if n is prime. If 7 is composite, 
say n = ab, then [a],, # 0 # [b],,, but [ab], = [n], = [0], . Thus, for n composite 
Z,, has divisors of zero and is not a field nor even an integral domain. The 
argument of 34 shows that the group of units of Z, is just the group Z,,. 


153a. Show that the ring Z,,,, is isomorphic to the direct product Z,, x Z, 
when (m,n) = 1. (152 contains the definition of direct product, and 165 
the definition of isomorphism.) 


154. The Ring of Gaussian Integers. Let Z(i) denote the set of complex 
numbers a + bi in which a and b are integers. Under the usual addition and 
multiplication of complex numbers, Z(i) is a ring. The elements of Z(i) are 
called Gaussian integers after Gauss, who first studied them as a generalization 
of the ordinary integers. Since Z(i) is a subring of the field Q(i), it has no 
divisors of zero. In other words, Z(i) is an integral domain. Note that the 
units of Z(i) are +1 and +i. 


155. Kummer Rings. Let p be a prime and let 1, (, 67, ..., ¢?-' denote the 
p-th roots of unity with ¢ = e?"!/?, We shall denote by Z(0) the smallest sub- 
ring of C containing all the elements of Z and ¢. It is immediately apparent 
that Z(C) contains all the complex numbers which can be written in the form 


a) +a,0+ ons (6 ia 


where dp, 4@,...,@,-,€ Z. Since Z(C) < Q(C), it follows that Z(¢) is an 
integral domain. It is easy to see that the numbers +1, +f,..., +0?’ are 
units of Z(¢), but at this point it is not easy to see whether there are others. We 
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shall call Z(¢) a Kummer ring after the mathematician E. E. Kummer (1810- 
1893), who studied the problem of unique factorization for these domains. 


156. Polynomial Rings. For any ring R the set of polynomials in the 
variable x with coefficients in R is a ring under the usual addition and multi- 
plication of polynomials. We denote this ring by R[x]. To give a precise status 
to R[x], we shall adopt the following more formal definition. 

Let N denote the set of nonnegative integers {0, 1, 2, ...}. A polynomial f 
over the ring R is a mapping f: NR which has the value 0 for all but a 
finite number of elements of N. We let f, denote the value of f on k EN. (Of 
course we are secretly thinking of f as fo+ fax t+fox? +°°'+f,x" with 
Jj, = 9 for k > n.) Now R[x] denotes the set of all such polynomials over R. 
Addition and multiplication of elements of R[x] are defined by 


k 
GF+9.=h+9 and (f9) = LSiIn-i- 


If fe R[x] and f, =0 for all ke N, we write f=0. If f 40, then we may 
define the degree of f by 


deg f= max{k Ee N|f, 4 0}. 


Finally, we observe that we may identify R with a subring of R[x] by letting 
ae R correspond to fe R[x], where fo = a and f, = 0 for k > 0. 

This definition of R[x] has the advantage that elements of R[x] are defined 
as genuine mathematical entities and not as “‘expressions of the form... ”’. 
It also has the advantage of allowing explicit definitions of addition and 
multiplication. Moreover, it generalizes easily to polynomials in several 
variables: a polynomial in n variables over R is simply a mapping 


f:Nx-(n)-s) xNoR 


which has the value 0 on all but a finite number of elements of its domain. 
On the other hand, this formal definition of R[x] has no real relation to 
the variable x. Another difficulty is that from force of habit we simply do not 
imagine a polynomial over R as a mapping from N to R. 
Regardless of the manner in which R[x] is defined, we may define the 
polynomial ring over R in two or more variables inductively by R[x,, x2] = 
(R[x,])[x2], and so forth. 


Proposition. If Ris an integral domain, then R(x] is also an integral domain. 
Proof. If f,g € R[x] and fg = 0, then fog9 = 0 in R. Therefore, since R is 


an integral domain, either fo = 0 or gg = 0 (or both). Suppose that fo = 0. 
Then it follows from 
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(79): =fi90 + fo9: =/iGo = 9 


that either 4, = 0 or gy = 0. Continuing in this manner, we are forced to the 
conclusion that either f= 0 org = 0. 

Alternatively, we might suppose that f; and g; are the first nonzero coeff- 
cients of the polynomials f and g. Then (/9);+; =/;9; #0, which shows that 
tg #9. Thus the product of nonzero polynomials in R[x] is nonzero. 


Corollary. [If R is an integral domain, then R[x, X2,...,%X,] is also an 
integral domain. 


156a. Determine the group of units of R[x]. 


1568. What conditions on R will insure that deg (fg) = deg f + deg g for 
any two polynomials f, g € R[x]? 


156y. The ring of power series R[[x]] over a ring R is the set of a/] mappings 
f: NR with the same rules of addition and multiplication as given above 
for R[x]. Determine the group of units of R[[x]]. 


157. An ideal of a ring R is an additive subgroup a of R with the property 
thatr e Rand ae aimplyra€é a. Clearly, the set containing the single element 
0 and the set consisting of the whole ring R are ideals. An ideal a is called 
proper ifa # {0} anda R. 


157a. Prove that the intersection a 4 b of two ideals a and b of a ring R 
is again an ideal of R. 


157B. Prove that an ideal containing a unit element is the whole ring. 


158. For each element a ofa ring R, the set 
(a) = {xe R|x=ra,re R} 


is an ideal, called the principal ideal generated bya. \t is easy to see that the 
principal ideal (a) is the smallest ideal containing a. In other words, if a is 
an ideal of R and ae a, then (a) c a. We note that (1) = R and consequently, 
1 Ea implies R ca, or what is the same thing, a = R. An element ae R is 
clearly proper if and only if (a) is a proper ideal of R. 


152 5 Ring Theory 


The integral domains in which every ideal is a principal ideal are of 
exceptional importance in ring theory. For brevity we refer to these integral 
domains as principal ideal domains. A field F can have only the improper 
ideals (0) and (1) = F. (Why?) Therefore a field is automatically a principal 
ideal domain. Fields, however, are the least interesting examples of principal 
ideal domains. The primary example of such a domain is Z, the ring of 
integers. To see this, we recall that every additive subgroup of Z has the form 
mZ (36) and observe that mZ is just the principal ideal (m). 


158%. Show that an element a of a ring R is a unit if and only if (a) = R. 
158B. Show that (a) < (b) if and only if a = rb for some r. 
158y. Show that (a) = (6) if and only if a = ub for some unit element uw. 


1585. Prove that in the ring of integers Z, (m) 7 (n) =([m, n]), where 
[m, n] denotes the least common multiple of m and n (23y). 


158. Let a and b be elements of a ring R. Show that the set 
c={xeR|x=ra+t sh; r,s eR} 


is an ideal of R and that it is the smallest ideal of R containing (a) and (6). 


158¢. Let a and b be elements of a domain R. Show that the set 
c= {xe R|axe(b)} 


is an ideal of R. 


158. Prove that every ideal of the ring Z, is principal. Is Z, a principal 
ideal domain? 


1586. Show that for n> 1 the ring of polynomials R[x,, x.,..., X,] Over 
a domain R is not a principal ideal domain. 


159. A euclidean domain is a nontrivial integral domain R together with a 
function, called the norm, 6: R* +N (where R* denotes R — {0}), such that 


(1) for alla,be R*, (ab) = (da)(db), 
(2) for all a,be R* there exist elements g,ré R such that a=gb+r 
where or < 6b or r=0. 


It clearly follows from 21 that the ring of integers Z is a euclidean domain 
setting da = |a| (absolute value). From 99 we see that the ring of polynomials 
F [x] overa field F is a euclidean domain with df = 2°84 for f4 0. Now we 
give a fresh example. 
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Proposition. The ring of Gaussian integers Zi) together with the function 
6: Z(i)* > N defined by 6(u + vi) = u? + v? is a euclidean domain. 


Proof. First, we observe that 
O(u + vi) =u? +0? =(u + vi\tu — vi) = |u + vil?. 
Consequently, for a, B € Z(i), we have 
J(ap) = |B? = |x|? |B]? = (5a)(58). 


Thus, (1) holds. To see that (2) holds, we use a little trick. Z(i) is a subring 
of the field Q(i). Thus, «, B € Z(i) implies a/B € Q(i), that is, a/B = u + vi for 
u,veQ. Let p be the integer nearest u, and q the integer nearest v. Let 
y =p+qie Z(i). Now we have « = yf + p where 


] 
Sp = plu — p) + (0 — a)il? = [BI (u= py? + © 4)") < (5) d8, 
since |u — p| < 1/2 and |v —q| < 1/2. 


Proposition. A euclidean domain is a principal ideal domain. 


Proof. Let b be a proper ideal of a euclidean domain R. Then among all 
the elements of b 7 R* there is (at least) one, say b, for which 6b is a minimum. 
If aebo R*, then a=qb+r for some q,reR. Since r=a—qbeb, we 
cannot have dor < 6b due to the choice of b. Therefore r = 0 and b|a for any 
ae b, or in other words, b < (b). However, (b) < b since b € b, and asa result, 
b = (0). This shows that R is a principal ideal domain. 


Corollary. The ring of polynomials F |x] over a field F is a principal ideal 
domain. 


159a. Show that the ring Z(qw), where 
w =e?t/3 $4 4./-3 


is a euclidean domain with 6(a + bw) = a? — ab + b?. 


159B. Let R be a euclidean domain. We shall say that de R is a greatest 
common divisor of the elements a,b eR (not both zero) if c|a and cl|b 
imply that c|d for any ce R. Show that any pair of elements a, b € R (not 
both zero) must have a greatest common divisor d, which can be written as 
ra+sb for some r,se R. (We interpret divisor in the usual sense: r|1 if and 
only if r = st for some s.) 
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159y. Let Z(./—3) denote the ring of complex numbers of the form 
a+b,/—3 where a,beZ. Let 6(a+b./—3) =a? + 3b?. Is Z./—3) a 
euclidean domain? Is it a principal ideal domain? 


1595. Show that an element u of a euclidean domain is a unit if and only if 
d(u) = 1. 


160. The sum of two ideals a and b of a ring R is the ideal 
at+tb={xeR|x=a+b,aea, bed}. 


(It must be verified that the set a + b defined above is an ideal of R, but this 
is routine.) Since every element aea can be written as a+0 and 0¢e 5, it 
follows that ac a+b. Similarly, bc a+b. In fact a+b is the smallest 
ideal of R containing both a and b, which is to say, ac c and bcc imply 
at+bce. 


160a. Prove the following properties of the sum of ideals: 


(a) (a+b) +c=a+(b+0), 
(b) a+(0)=a =(0) +a, 
©) GO) Git ae, 
(4d) a+b=b+a. 


1608. Show that 

an(b+c)>(anb)+(anc) 
and 

a+(bnc)c(at+b)n(a+c) 
for any three ideals a, b, c of a ring R. 


160y. Let (a,,a,,...,a,) denote the smallest ideal of a ring R containing 
the elements a,, a,,...,a,¢€ R. Prove that 


(a,, a2, ..., 4,) = (41) + (a2) + °° + G,). 


1605. Show that, in the ring of integers Z, (a) + (6) = (d) where d is the 
greatest common divisor of a and b. 


161. The product ab of two ideals a and b of a ring R is the smallest ideal of 
R containing all products of the form ab where ae aandbeb. If a,,a,,..., 
a,éa and b,,b,,...,5,¢6, then a,b,,a,b,,...,a,b,¢€ ab, and conse- 
quently, the sum 


2 Gib; = aby + ay by + +++ + ayy (1) 


— 
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is an element of ab. In fact we could define ab as the set of all sums of the 
form (1). (Simple products a,b, are included under the case n = 1.) 


16la. Prove the following properties of the product of ideals: 


(a) a(bc) = (ab)c, 
(b) a(l) =a =(I)a, 
(c) ab=ba, 

(d) abc(anb), 
(e) (a)(b) = (ab). 


161f. Prove that the product of ideals is distributive over the sum of ideals; 
that is, 


a(b + c) = ab + ac 


for any three ideals a, b,c of aring R. 
16ly. Show that a(b nc) < ab cs ac for ideals of a ring R. 
1615. Show that a + b = R implies a m b = ab for ideals a, b of R. 


16le. Ifa and b are ideals of a ring R, then we define their quotient to be 
the set 


a: b= {xe R|xbea for all be b}. 


Show that the quotient of two ideals is again an ideal of R. 


161¢. Show that the quotient operation on ideals in a ring R has the follow- 
ing properties: 


(a) (a: b)b ca, 

(b) (a:b): ¢ =a: (be), 

(c) a:(b+c)=(a:b) (a: 0), 

(d) (anb):c=(a:c)N(b: 0), 

(ec) a:b=R if and only if bea. 


161n. In the ring of integers Z compute the ideals: 


(a) (2) + (3), 

(b) (2) + (4), 

(c) 2)1@)+@), 
(d) (2)(3) 9 (4), 

(e) (2)(3) 9 (2)(4), 
(f) (6) 1 (8), 

(g) (6)(8), 

(h) (6): (2), 

(i) (2): (6), 

Gj) @): @). 
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1616. Let Q[x, y] denote the ring of polynomials over the rational field Q 
in the variables x and y. Compute the ideals indicated, writing each in the 


form (/,,/5,---,Jn). (See 160y.) 


(a) (x) n(y), 

(b) (x+y? =(x+ x+y), 
(Ci Gay), 

@) G)nG.p 

(ce) (x72 + xy) n(xy+y’), 
(f) (x)+Q), 

(g) u@etel) 109), 

(h) (x? + xy\(x- y), 

(i) (7) n (xy) + (”)), 
(j) (x-y)(x) + (’)), 

(k) (xy): (), 

ME sa (6) 
(m) (x): (y), 

(n) (x+y)?:(x), 

(0) (xy): (x, y). 


162. A prime ideal of a ring R is an ideal p such that abe p implies aep 
or be p (or both). A maximal ideal of a ring R is an ideal m, other than R 
itself, such that for any ideal a of R, m ca impliesa=mora=R. 


Proposition. A maximal ideal is a prime ideal. 


Proof. Suppose m is a maximal ideal of a ring R. If ab € m and a ¢ m, then 
the ideal (a) +m contains m as a proper subset. Since m is maximal, 
(a) +m=R. It follows that 1 =ca+m for some ce R and some mem. 
Consequently, b = cab + mb em. This shows that m is a prime ideal. 


162a. Prove that the ideal (m) is prime in Z if and only if n = 0, +1, or |n| is 
prime. 


162B. Prove that every proper prime ideal of Z is maximal. 


162y. Let F bea field. Show that (/) is a prime ideal of F [x] if and only if f 
is constant or irreducible. When is (f) maximal? 


1625. Find a proper prime ideal of Q[x, y] that isn’t maximal. 


163. Proposition. If a,b, and p are ideals of a ring Rand p is a prime ideal, 
then ab c p impliesa < p orb cp (or both). 
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Proof. Suppose that ab c p and a ¢ p. Choose aea — p. For any beb 
we have ab € ab c p. Since p is prime and a ¢ 9, it follows that b € p for any 
beb. Therefore b c p. 


164. Proposition. A ring R is an integral domain if and only if (0) is a prime 
ideal, and a field if and only if (0) is maximal. 


Proof. The first part of the statement is an immediate consequence of the 
definitions of integral domain and prime ideal. For the second, if R is a 
field, then R has only the ideals (0) and (1) = R, and (0) is clearly maximal. On 
the other hand, if (0) is maximal, then for a # 0, (a) = R and ra = 1 for some 
ré R, or in other words, a has a multiplicative inverse a~' = r. Thus, R is a 
field. 


165. A homomorphism of rings is a mapping from the set of elements of one 
ring to the set of elements of another which preserves addition, multiplication, 
and the identity element. In other words, a mapping ¢@: R- R’ is a ring 
homomorphism if R and R’ are rings and if for alla, be R, 


(1) $(@ +) =(¢a) + (5), 
(2) (ab) = (pa)($b), 
(3) (1) = 1’, where | and 1’ denote the identity elements of R and R’. 


A ring homomorphism ¢: R— R’ is called: 


(1) a monomorphism if ¢ is one to one; 

(2) an epimorphism if @¢ is onto; 

(3) an isomorphism if @ is a one-to-one correspondence; 

(4) an endomorphism if R' = R; 

(5) an automorphism if R' = R and @ is a one-to-one correspondence. 


Just as in the case of group homomorphisms, if ¢ is an isomorphism, then the 
inverse mapping @~' preserves addition, multiplication, and the identity 
element, and is again an isomorphism. 


165a. Let ¢: R-> R bearing homomorphism. The kernel of @ is the set 
Ker ¢ = {x Ee R| dx = 0}. 


Prove that Ker @ is an ideal of R. 


1658. Show that there is a unique ring homomorphism ¢: Z— R. The 
characteristic of R is n (positive or zero) if Ker @ = (n). Exhibit a ring with 
characteristic 7 = 0; 1,25... ; 
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165y. Prove that the characteristic of an integral domain is 0, 1, or a prime. 


1656. If char Rj =n and char R, =m, what is char (R, x R,)? (See 152e 
for the definition of R,; x R,.) 


165e. Determine the number of ring homomorphisms Z, > Z,, . 


166. Proposition. If 6: R>R' is a ring homomorphism and a’ is an ideal 
of R’, then a = $7 1a’ is an ideal of R. Furthermore, if a’ is a prime ideal, then 
a is also. If ¢ is an epimorphism and a’ is maximal, then a is also maximal. 


Proof. It is easy to see that a =~ ‘a’ is an additive subgroup of R. If 
aeaandreR, then 


(ra) = $(r)P(a) € 0’, 


and thus ra € a. This tells us that a is an ideal of R. If a’ is prime and abea, 
then 


f(ab) = $(a)G(b) € a’ 
and either 


gaea’ or bea’, 


which proves either ae a or b €a. Thus, a is prime. 

Suppose now that a’ is maximal, and that ¢ is onto. Ifa < b, thena’ c #b. 
Since ¢ is an epimorphism, @b is an ideal of R’ as the reader will easily verify. 
Since a’ is maximal, we have either #b = a’, in which case bc f-'b = 
¢@ ‘a’ =a and b =a, or else ¢b = R’. In this last case, b contains an element 
b such that @b = 1’. Then ¢(1 —b) =0 and 1 —beacb. Consequently, 
1 =(1 —b) + beb and b=R. We have shown that ac b implies b =a or 
b = R, and therefore a is maximal. 


Remark. In the last statement of the proposition we cannot remove the 
restriction that @ is an epimorphism. The inclusion mapping i: Z > Q gives 
an example: (0) is a maximal ideal of the field Q, but i~'(0) = (0) is not 
maximal in Z. 


166a. Prove that every ideal of the direct product R, x R, of two rings 
has the form a, x a, where a, and a, are ideals of R, and R,, respectively. 


166B. Let ¢ be a ring epimorphism from R to R’. Show that 
p ‘(a’ +b’) = (p1a’) + 1b’) 
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and that 
p *(a’b') = (p-*a')(p-'b’) 


where a’ and b’ are ideals of R’. 


167. Quotient Rings. Let a be an ideal of the ring R. We can define an 
equivalence relation on R by the rule: a= mod a if and only if a— bea. 
This is called congruence modulo the ideal a. (a=bmoda is read “a is 
congruent to b modulo a.”) Additively, a is a normal subgroup of R, and 
congruence modulo a is a special instance of congruence modulo a normal 
subgroup (37). This makes it unnecessary to verify that the properties (17) 
of an equivalence relation hold. 
The equivalence class of re R is denoted r + a; that is, 


rta={xeR|x—rea} 
={xe R|x=r+a,aeq}. 


The set of all equivalence classes of elements of R under congruence modulo 
a is denoted R/a since additively it is simply the quotient group of R by the 
normal subgroup a. It is clear that R/a is an abelian group with addition 
defined by the rule 


(a+a)+(b+a)=a+b+a. 
Furthermore, R/a is a ring in which multiplication is given by 
(a+ a)\(b+a)=ab+a. 


The reader should demonstrate for his own satisfaction that this multiplication 
is well defined and that, furnished with these operations, R/a satisfies the 
axioms of ring structure (150). 

We note that if R is the ring of integers Z, and a is the principal ideal (n), 
then R/a = Z/(n) is the ring Z, (153). 
1670. Let ¢: R>R’ be a ring epimorphism. Prove that R/(Ker @) is iso- 
morphic to R’. (Ker @ is defined in 165e.) 


1678. Show that the set of ideals of the quotient ring R/a is in one-to-one 
correspondence with the set of ideals of R containing a. 


167y. Let a and b be ideals of a ring R such that ac b. Show that the 
mapping ¢: R/a— R/b given by ¢d(a+a)=a+b is a well-defined ring 
epimorphism, and compute Ker @. 


1675. Let a denote the ideal of Z(i), consisting of all Gaussian integers 
a + bi such that a = 6 mod 2. Describe the quotient ring Z(i)/a. 
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167s. Let R'°:'! denote the ring of continuous functions on the closed unit 
interval [0, 1]. Let a denote the ideal of R'°:') consisting of all continuous 
functions f: [0,1] R such that f(1/2) =0. Describe the quotient ring 
Rl 1g, 


167¢. Leta and b be relatively prime ideals of a ring R, that is,a+b=R. 
Prove that 


R/(ab) = (R/a) x (R/b). 


168. Proposition. The quotient ring R/a is an integral domain if and only if 
a is a prime ideal. R/a is a field if and only if a is a maximal ideal. 


Proof. The mapping ¢: R— R/a which assigns to each element re R its 
equivalence class gr =r +a in R/a is a ring epimorphism. The ideal of R/a 
containing only 0 is the equivalence class of Oe R. In other words, in R/a, 
(0) = a, and further, 67 '(0) =a. 

If R/a is an integral domain, then (0) is a prime ideal of R/a (164) and 
a = ¢ 1(0) is a prime ideal of R (166). On the other hand, if a is a prime ideal, 
R/a cannot have zero divisors: 


(a+ a\b+a)=ab+a=a 


implies abea; hence, either ae€a or bea, which implies a+a=a or 
b+a=a. 

Similarly, if R/a is a field, then (0) is a maximal ideal of R/a (164) and 
a = ¢~1(0) is a maximal ideal of R (166). On the other hand, suppose a is 
maximal. An element a + a is zero in R/a if and only if aea. If a ¢a, then 
(a) + a = R because a is maximal. As a result, | € (a) + a, that is, 1 =a’a+ a" 
for some a’ € R, a” ea. Now we have 


(a +al(a+a)=aa+a=1+a. 


Clearly 1 + 4 is the identity element of R/a and (a+ a)"' =(a@' +a). Since 
every nonzero element of R/a has an inverse, R/a is a field. 
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169. A factorization r=ry,r,--+r, of an element r of a ring R is a proper 
factorization of r if each factor r; is a proper element (not a unit or zero) of R. 
A factorization having units or zero among the factors is called improper. 
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Every element r of a ring has an improper factorization r = Ir, but not 
every element has a proper factorization. An element of a ring which has no 
proper factorization is called a prime. A proper element which is prime is 
called a proper prime. Clearly the unity element | is always a prime (improper), 
but the zero element 0 is a prime (improper) if and only if the ring is an 
integral domain. (Why ?) 

For example, the prime elements of Z are 0, +1, and +p, where peN is 
prime in the ordinary sense (22). The primes in the polynomial ring F[x] 
over a field F are the constant polynomials (improper) and the irreducible 
polynomials (proper). 

Two elements of a ring are associates if each one is a multiple of the other 
by a unit. It is not difficult to see that association in this sense is an equivalence 
relation. 


169a. Prove that two elements of an integral domain are associates if and 
only if they generate the same principal ideal. 


1698. Prove that any associate of a prime is a prime. 


169y. Which of the numbers 3, 5, 7, 11, 13, 17, 19 are prime in the ring 


Z(,/2) = {xe R|x=a+b./2, a, be Z}? 


170. If aand b are elements of a ring R, we say that a divides b (written a |b) 
if b = ra for some re R. The set of all elements of R divisible by ae R is just 
the principal ideal (a). Furthermore, a|b if and only if (6) < (a). 

An element de R is a greatest common divisor of elements a and b of R 
provided 


(a) dl|aandd|b, 
(b) ce Randcla,c|b imply c|d. 


Generally, there is not a unique greatest common divisor: if d is a greatest 
common divisor, then so is d’ = ud, where u is a unit. In an integral domain R, 
any two greatest common divisors d and d’ of a and b are associates. (We 
must have d|d’ and d’|d so that d’=rd and d=r'd' =r'rd which implies 
r'r = 1 when d # 0; d=0 implies d’ = rd = 0.) 


Proposition. Ina principal ideal domain R, an element d is a greatest common 
divisor of two elements a and b if and only if 


(d) = (@) + (®). 


Proof. Since d|a and d|b imply (a)<(d) and (b)c(d), we have 
(a) + (b) < (d) when d is any common divisor of a and b (160). Since R is a 
principal ideal domain, (a) + (6) =(c) for some ce R and (c) < (d), implies 
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d|c. On the other hand, (a) < (c) and (6) < () imply c|a and c|b. Therefore, 
c|dand (d) <(c) if dis a greatest common divisor of a and 3, and it follows 
that (d) = (c) = (a) + (0). 

Suppose (d) = (a) + (b). Then (a) <(d) and (6) c(@), from which we 
conclude that d|a and d|b. If c|a and c|b, then as before (a) < (c) and 


(b) <(c) so that (d) = (a) + (6) < (c), and therefore c|d. Thus, d is a greatest 
common divisor of a and b. 


It is clear from this proposition that in a principal ideal domain every pair 
of elements has a greatest common divisor. The following corollary is another 
immediate consequence of this proposition. 


Corollary. If anelement d of a principal ideal domain R is a greatest common 
divisor of elements a,be R, then there exist elements r,r’' €.R such that 
d=ratr'b. 


Corollary. If p is a prime element of a principal ideal domain R, then p|ab 
implies p|a or p|b. 


Proof. Suppose p|ab and pja. Then | is a greatest common divisor of 
p and a. (Why?) By the preceding corollary, 


l=rp+ra 
for some r,r’é R. Then b = brp + r‘ab is divisible by p. (This is essentially 
the same proof as given in 23 and 103.) 
170e. Anelement m of aring R is a least common multiple of two elements 
a, be Rif and only if 


(1) a|m and b|m, 
(2) a|c and b|c for any element c € R, then m|c. 


Show that m is a least common multiple of a and b in R if and only if 


(m) = @) 1 (2). 


1708. Let d be a greatest common divisor and m a least common multiple 
of elements a and b of a domain R. Show that dm and ab are associates. 


171. A unique factorization ring is a ring in which the following conditions 
hold: 


(1) every proper element is a product of proper primes (not necessarily 
distinct); 
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(2) two factorizations of the same proper element as a product of proper 
primes have the same number of factors; 

(3) if r =p; P2*** Per =9192°°*9, are two factorizations of a proper ele- 
ment r into a product of proper primes, then there exists a permutation 
of k letters, ne S, such that p; and q,,;) are associate primes for 
Be ee est Kin 


It is not possible to require more than this. Given a product of proper 
primes p,p,°**p,, we can select units u,,u2,..., 4, so that their product 
u,u, *** u, = 1, and then for any 7 € S, we have p,p2 °** Py = 4192 °** 9, where 
9n(i) = 4; p;- We note that even in the ring of integers we have twelve factoriza- 
tions of 12: 


Pees — (2-3) =(— 2)" 2"(— 3) — (= 2)(—2) +3, 
12 = (2-3-2 =2-(—3)(—2) =(—2) -3 -(—2) = (—2(—3) - 2, 
ere ao) 2) (3) - 2 -(—2) =(— 3X2) «2. 


We shall be mainly interested in unique factorization domains, that is to say, 
integral domains which satisfy (1), (2), and (3). We observe, however, that for 
p prime, Z,, is a unique factorization ring which is not an integral domain. 


171a. Prove that Z,, is a unique factorization ring (p prime). 


171B. Show that a quotient ring of a unique factorization ring need not be 
a unique factorization ring. 


172. Theorem. A euclidean domain is a unique factorization domain. 


Proof. First we remark that an element x in a euclidean domain R with 
norm 6: R* +N is a unit if and only if d(x) = 1. Clearly 6(a) = d(la) = 
6(1)d(a) implies 6(1) = 1, from which it follows that a unit uw has norm 1, 
because 


1 = d(uu~') = d(u)d(u7'). 


Now suppose 6(x) = 1. Then we have | = qx + r wherer = 0, since d(r) < d(x) 
is impossible. Therefore, | = gx and x is a unit. 

Next we show by induction on norms that every element re R with 
6(r) > 1 is a product of proper primes. Clearly, 6(r) =2 implies r is prime. 
Suppose that | < 6(r) < n implies that r is a product of proper primes, and let 
O(r) =n. If r itself is a prime, then it is a proper prime and we are finished. If 
r is not a prime, then it has a proper factorization r = ab. Then 6(r) = 6(a)6(d), 
from which we conclude that | < 6(a) < nand | < 6(b) <n. By the induction 
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hypothesis it follows that a = p,p, °°: p, and b =q,q2°--q, where the p,’s 
and q,’s are proper primes. Finally, we have 


r= ab =pypr°** Pe 192°" MN; 


and the induction step is complete. 

It remains to show that factorization is unique in the sense of 171. We 
proceed as follows. To every proper element r € R we assign a natural number 
I(r) called the length of r, by taking I(r) to be the minimum number of proper 
primes occurring in any factorization of r as a product of proper primes. 
Therefore every prime factorization of r has at least I(r) factors, and there is 
at least one prime factorization with exactly I(r) factors. Our proof is by 
induction on I(r). 

If I(r) = 1, then r is prime and cannot have a proper factorization. Thus, 
r=D,P2°** DP, implies k =1 and r, = p,. As the induction hypothesis we 
assume that for I(r) =k, any two factorizations of r are equivalent in the 
sense of 171. Suppose now that I(r) =k+1 and r=p,p.°-: Py4, iS a 
minimum factorization of r as a product of primes. Let r=q,q,°°:q,, be 
another factorization. Since p,+,|r, it follows that p,,,|9; for some j. 
(Why?) Then q; = up,41, where u must be a unit since q; is prime. Now we 
divide p,,, out of both factorizations, obtaining 


r' = DiP2*** Pe= N92 °°" Gj “Gaadeyae "Ams 


where r’p,4, =r. Clearly I(r’) =k, and these two factorizations of r’ are 
equivalent by the induction hypothesis. It follows that the two factorizations 


r= PyP2°°* Pr=%192°°° Im 


were equivalent. 


Corollary. The rings Z, Z(i), and Z(w) where w = e?"!? are unique factoriza- 
tion domains. (See 159 and 159.) 


Corollary. If F is a field, then the polynomial ring F [x] is a unique factorization 
domain. (This is the theorem of 104.) 


173. The set %p of ideals of a ring R may be considered as an algebraic 
structure under the product of ideals defined in 161. This product is associative 
and has an identity, namely the ideal (1) = R. (An algebraic structure of this 
type is called a monoid.) ay 

Let ¢: R> %p denote the mapping which assigns to each element ae R 
the principal ideal (a) € 4%. Ring multiplication makes R a monoid, and ¢@ 
may be viewed as a “homomorphism of monoids,” that is to say, 
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(ab) = (¢a)(pb). To see this, we note that ab e(a)(b), which shows that 
(ab) < (a)(b); on the other hand, (ab) contains all the products of the form 
a'b’ where a’ €(a), b’ €(b), which shows that (a)(b) < (ab). Thus, (ab) = 
(ab) = (a)(6). 

Obviously R is a principal ideal domain if and only if the mapping @ is 
onto. Furthermore, two elements a, b € R are associates if and only if they 
have the same image under @, that is, (a) =(b). (See 169«). The group of 
units of R is just U= ~‘(1). An element ae R is proper if and only if (a) 
is a proper ideal. 


Lemma. An element pe R, a principal ideal domain, is a prime if and only if 
the ideal dp = (p) is a prime ideal. 


Proof. Suppose p is prime. If abe (p), then ab = rp for some re R, and 
p|ab. By the second corollary of 170, we have p|a or p|b, from which we 
conclude that ae (p) or be (p). Thus (p) is prime. 

On the other hand, suppose (p) is a prime ideal. If p = ab, then abe (p). 
Consequently, ae (p) or b € (p). However, a € (p) implies a = pr = abr, from 
which it follows that br = 1 and that 6 is a unit. Similarly, b € (p) implies 
that ais a unit. We see that p has only improper factorizations and is therefore 
prime. 


Theorem. Every proper ideal of a euclidean domain can be factored uniquely 
(except for the order of the factors) as a product of proper prime ideals. 


Proof. Let R be a euclidean domain. If a = p,p,--- p, is a factorization 
of the proper element ae R as a product of proper primes, then (a) = 
(p,)(P2) +: (py) is a factorization of the ideal (a) € %p as a product of prime 
ideals. Thus the unique factorization theorem for elements (172) implies 
factorization of ideals. Suppose now that (a) = (q;)(q2) °** (q,) is a second 
factorization of the proper ideal (a) as a product of proper prime ideals of R. 
Since (a) = (9:92 °°: 4,), the elements a and q,q, -:: q, are associates. In other 
words, a=uq,g,°°'q, for some unit we R. The ideals (uq,) =(q,) and 
(42), (93), ---, (g,) are prime, and by the lemma above, the elements uq,, 92, 

.., 4, are primes. As a result 


a = PyPo°** Pe = (UGs)92 °° M1 


and we have two factorizations of a as a product of primes. Now it follows 
from 172 that / =k and that there is a permutation of k letters, 7 € S,, such 
that g; and p,,;) are associates. This implies that (q;) = (p,y;)) fori = 1, 2,..., k. 
Clearly the factorizations 


(a) = (py)(P2) °° * (Pa) = (9192) *** A) 


are the same except for the order of the factors, which completes the proof. 
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This unique factorization theorem for ideals is a more natural result than 
the unique factorization for elements preceding it—first, because it is much 
easier to say what uniqueness means, and second, because there is a very large 
class of rings which have unique factorization of ideals but not of elements. 
(An example of such a ring is Lig) 5): which is discussed in the exercises 
below.) In other words, we can generalize the theorem just proved to rings 
which are not euclidean nor even principal ideal domains. This involves a 
more thorough study of ideals than we have made and will be carried out in 
the next chapter. 


173a. Prove that in a principal ideal domain every proper prime ideal is 
maximal. 


173B. Let Z(./—5) denote the set of complex numbers of the form 


a + b,/ —5 with a, be Z. Show that Z(./ —5) is a ring under the usual addi- 
tion and multiplication of complex numbers. 


173y. Fora+b,./—5eZ(./—5S) define the norm of a+ bs / = to be 


N(a + b./—5) = |a+ b./—5| =a? + 5b? 


so that N(aB) = (Na)(NB) for all «, Be ZY 2 Using this norm, determine 
the units of Tig =) and show that 2, 3, and 1 + y= are proper primes in 
Z./=5). (Then 6 has two inequivalent factorizations, 6=2-3 and 
6=(1 + J/-5(1 - as): so that Z(./—5) is not a unique factorization 
ring.) 


1735. Let p denote the set of elements of ZC) 5) of the form a + 5 
with a = b mod 2. Show that p is a maximal prime ideal of Xf —5). Show 
further that p = (2, 1 + eS) and that p? = (2). (p? is just the product pp.) 
Show that p is not a principal ideal. 


174. As an illustration of the preceding articles, we examine in detail 
factorization in the domain of Gaussian integers, Z(i). We recall that Z(i) 
consists of all complex numbers a + bi where a and b are ordinary integers. 
The units of Z(i) are +i and +1. Now Z(i) is a euclidean domain (159), and 
consequently a principal ideal domain and a unique factorization domain 
(172). 

The primes of Z(i) are called Gaussian primes. We observe that 
z=a+ bie Zi) is a Gaussian prime if and only if its complex conjugate 
Z =a-— bi is also a Gaussian prime. Indeed, z has a proper factorization 
z = uv if and only if Z has the conjugate factorization Z = uv, which is also 
proper. Of course it may happen that a = 0 or b = 0 in which case z = + 2, 
and z and 2 are associated primes. 
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Suppose z = a + bi is a proper Gaussian prime and a #0 ¥ BD. In this case 
we claim that zz =a* +? is a prime in the ordinary sense. If not, then 
a’ + b? = mn where m,neéN and m #1 ¥n. Furthermore, we may assume 
that z and Z are relatively prime: otherwise they must be associates, which 
implies that a= +b = +1 and a’ + b? =2; in this case we are finished since 
2 is prime. Since z is prime and z|(a* + b”), we must have z|m or z|n (170). 
However, z|m implies m = zw and, since m is real, m = m = ZW, so that 
Z|m. Furthermore, z and 2 are relatively prime, and therefore z|m and Z|m 
imply zZ|m and m = a? + b?, n=1. The assumption z|n leads likewise to 
the conclusion n = a? + b?, m = 1. These are contradictions and a” + b? is 
prime. 

On the other hand, if z = a + bi is a Gaussian integer with a? + b? a proper 
prime of Z, then z is a Gaussian prime. This is easy to show: z = wv implies 
Z = ud, from which it follows that 


a? + b? = zz = (uv)(iid) = (ui) (vd), 


which gives a factorization in Z of a? + b*, which was assumed prime. To 
sum up, we have seen that a Gaussian integer a + bi witha # 0 # b is a(proper) 
Gaussian prime if and only if a* + b? is a (proper) prime of Z. 

Clearly other proper primes of Z(i) must have the form +p or +ip where p 
is a prime in the ordinary sense and p 4 a? + b? for any integers a and b. 
These primes are characterized by the following result. 


Theorem. A positive proper prime p€ Z is a proper Gaussian prime if and 
only if p = 3 mod 4. 


Proof. Let p be a positive proper prime of Z. We make use of the ring 
Z,(i) of elements a + bi where a,be Z, with addition and multiplication 
given by 


(a+ bi)+(c+di)=(at+c)+(b6+4)i 
and 
(a + bi)(c + di) = (ac — bd) + (ad + be)i. 
A ring epimorphism ¢: Z(i) > Z,(i) is given by 
(a + bi) = [a], + [6],i, 


where [x], denotes the congruence class of x modulo p. Obviously Ker ¢ = (p), 
the principal ideal of Z(i) generated by p. As a result, Z,(i) is isomorphic 
to the quotient ring Z(i)/(p). (See 167a.) Likewise there is a ring epimor- 
phism y: Z,[x] > Z,(i) given by (/) =/(i) for any polynomial f over Z,. 
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The kernel of y is easily seen to be (x? + 1), the principal ideal of Z,[x] gen- 
erated by x7 + 1. Consequently, Z,(i) is isomorphic to the quotient ring 
Z,[x](x? + 1). We can summarize what we have proved so far in the state- 
ment: 


Z(i)/(p) © Z,(i) © Z,[x](x* + 1). 


From this we see that the eight statements below are equivalent: 


(1) pis prime in Z(i); 

(2) (p) is a maximal prime ideal of Z(i); 

(3) Z(i)/(p) is a field; 

(4) Z,(i) is a field; 

(5) Z,[x]/(x? + 1) is a field; 

(6) (x? + 1) is a maximal prime ideal of Z,[x]; 
(7) x? +1 is irreducible over Z,; 

(8) x? +1 has no root in Z,. 


Thus, the question of primeness of p in Z(i) reduces to whether x” + 1 has a 
root in Z, or not. In Z, we have +1 = —1 and x? + 1 has the root 1. If p 
is an odd prime, then a root « of x? + 1 over Z, satisfies a7 = —1 and a* = 1. 
Therefore a is an element of order 4 in the multiplicative group Z, of Z,. 
Conversely, an element of order 4 in Z,, is a root of x? + 1. (Why?) However, 
Z,, is a cyclic group (100) and therefore has an element of order 4 if and only 
if 4 divides o(Z;) = p — 1. Thus, we see that x? + 1 has a root in Z, if and 
only if p =2 or p= 1 mod 4. Finally, we have that p is prime in Z(i) when 
p#2 and p#1 mod 4, which is to say, when p = 3 mod 4. 


Corollary (Fermat). Every prime p of the form 4m + | can be written uniquely 
as the sum of two squares. 


Proof. Since p is prime in Z, but not in Z(i), there is a Gaussian prime 
a+ bi with a#0#b5, which divides p. Then a — bi also divides p and 
a’ + b* =(a + bi)(a — bi) divides p*. However, a? + b? is prime, therefore 
a? + b* = p. Uniqueness follows from unique factorization in Z(i). (Why?) 

Now we have completely determined all the primes of Z(i). The improper 
primes are 0, +1, and +7. The proper primes are of the form +p, +ip, where 
péNisa prime of the form 4m + 3, and of the form a + bi where a? + b? is 
prime. 


174. Determine all the primes of Z(i) with absolute value 5 or less. 
174B. Factor as a product of primes in Z(i) the numbers 15 and 6 + 81. 


174y. Determine completely the natural numbers that can be written as the 
sum of two squares. 
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175. Fermat’s Last Theorem. Pierre de Fermat (1601-1665), whom LaPlace 
characterized as “‘the true inventor of the differential calculus,” discovered 
many theorems in number theory as well. His interest in the subject was 
aroused by the appearance in 1621 of Claude Bachet’s edition of the Arith- 
metica of Diophantus. Beside the eighth proposition of the second book 
(‘To divide a square number into two other square numbers”’), Fermat in 
1637 made the following scholium: 

“To divide a cube into two cubes, or a fourth power into two fourth 
powers, and generally any power whatever beyond the second into two of the 
same denomination, is impossible. Of this fact I have discovered a very 
wonderful demonstration. This narrow margin would not take it.”’ 

This statement, that for n > 2 the equation x” + y” = z” has no solutions 
in which x, y, and z are natural numbers, is called Fermat’s last theorem or 
Fermat’s great theorem (as opposed to Fermat’s “‘ little”? theorem given in 42). 
Despite the strenuous efforts of many eminent mathematicians, among them 
Euler, Legendre, Abel, Gauss, Dirichlet, Cauchy, Kummer, Kronecker, and 
Hilbert, no general proof has been attained. It seems likely that Fermat was 
mistaken in believing he had a proof. 

In the attempt to prove Fermat’s last theorem, much valuable mathematics 
has developed. The classical ideal theory, which forms the subject of the next 
chapter, is one result. Our interest in Fermat’s last theorem at this point is 
due to its connection with unique factorization in the Kummer rings Z(p) 
defined in 155. 

For a long time it was thought that the ordinary laws of arithmetic, such 
as the division algorithm and unique factorization, must extend tothe domains 
Z(p), where p = e2*'/? and p is prime. Gabriel Lamé (1795-1870) gave a proof 
of Fermat’s last theorem assuming unique factorization in Z(p), in the year 
1847. The error in Lamé’s proof was observed by Joseph Liouville (1809-1882) 
and by Kummer. Cauchy, also in 1847, gave a false proof that Z(p) is a 
euclidean domain. (The first prime for which this fails is 23.) In the years 1844 
to 1851 Kummer developed a theory of unique factorization for these rings. 
Kummer’s theory was eventually superseded by the theory of ideals developed 
by Dedekind and Kronecker, but he did succeed in proving Fermat’s last 
theorem for a large class of exponents. 

We shall consider here only the first case of Fermat’s last theorem, the 
equation x° + y? = z°. Evidently Fermat knew a proof of this case, but the 
first published proof is that of Euler’s Elements of Algebra, Chapter XV, 
Volume II (second edition, 1774). A much simpler proof, due to Gauss, is 
based upon the arithmetic of Z(w) where w = e?*"/3. This is the proof we 
shall give. 

Z(w) is the set of all complex numbers of the form a + bw where a, b € Z. 
Note that w? = ®@ = —1 — aw. Now Z(q) is a euclidean domain (159a) and 
therefore a principal ideal domain and a unique factorization domain. The 
norm of z=a + ba is 
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2Z = (a+ bw)(a + bw”) = a* — ab + b?. 


Since a unit must have norm 1, it follows that Z(w) has only the six units 
+1, +, and +’. By arguments entirely similar to those for Z(i) in 174, 
we obtain the following: an element a+ bwe Z(w) with a#0Fb is a 
(proper) prime of Z(w) if and only if its norm a* — ab + b? is a (proper) prime 
of Z. The other primes of Z(w) have the form +p, +pw, or + pw? where 
pé Zis a proper prime and p ¥ a” — ab + b? for any integers a and b. These 
are characterized by the following result. 


Theorem. A positive proper prime p € Z is a proper prime of Z(w) if and only 
if p= 5 mod 6 or p=2. 


Proof. We construct the ring Z,(w) of elements a + bw where a,be Z, 
by defining addition and multiplication as follows: 


(a+ bw) + (c + dw) =(a+c)+(b+d)w 


and 

(a + bw)(c + dw) = (ac — bd) + (ad + be — bd)w. 
It follows (as in 174) that we have ring isomorphisms: 

Z(w)(p) © Z,(w) = Zp[x](x* + x + 1). 


Thus the question of primeness of p € Z(w) is equivalent to irreducibility of 
x? +x+1 over Z,. Over Z,, x7 +x+1 is irreducible because neither 
element of Z, can be a root. Now suppose p is an odd prime and x? +x + 1 
has a root «e Z,. Clearly « # 0 and 1/a is also a root, since the product of 
the two roots of x? + x + 1 is 1. Since the sum of the roots is —1, we have 
a +(I/a) = —1. Squaring yields «? +2 +(l/a?)=1, or a* +0741 =0, 
which shows that «? is also a root. The polynomial x* + x + 1 can have at 
most two roots in Z,, therefore a? =a or a? = I/a. If a? =a, then a =1 
and 1+1+1=0OinZ,, implying that p = 3. If a? = 1/a, then a? = 1 and « 
is an element of order 3 in the cyclic group Z, of order p — 1. As a result, 
3|(p — 1). Since p is odd, 2 |(p — 1), and therefore 6 |(p — 1). To summarize, 
x? + x +1 is reducible over Z,, or in other words x? +x-+4+1 has a root in 
Z,, precisely when p = 3 or p= 1 mod 6. The theorem follows from the fact 
that p prime, p # 3, and p# 1 mod 6 imply p = 2 or p = 5 mod 6. (Why?) 


Theorem. The equation x? + y® =z? has no solution in natural numbers. 


Proof. We shall actually prove more: the equation x? + y? = z? has no 
solutions in Z(w) except the trivial ones in which x, y, or z is zero. Suppose 
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x, y, and z are three nonzero elements of Z(w) such that x* + y® = z°. Clearly 
we may assume that x, y, and z are relatively prime in pairs, which is to say 
that in terms of ideals, 


(x,y) = (2) = (2, x) = (1) = Z@). (1) 


Indeed, if two of the three quantities x,y, and z had a greatest common 

divisor d which was not a unit, then the whole equation would be divisible by 

d°, and (x/d), (y/d), (z/d) would be a solution for which the assumption held. 
We note that 1 — wm has norm 


(1 — @)(1 — @) = (1 — @)(1 — @’) = 3, 
and is therefore prime in Z(w). Furthermore, its complex conjugate is 
l1—o=l1 —- ow’ =o -—o? =o — = -—w(1 —). 


Since —w? is a unit, 1 —w and 1 —@ are associates. Therefore if 1 —w 
divides a € Z(w), then 1 — w also divides &, the complex conjugate of a. We 
observe that every element of Z(w) is congruent to 0, 1, or 2 modulo (1 — w): 
given a + bwe Zw), we have a+ b = 3g + r where 0 <r <3 and 


at+bw=a+b=3q+r=rmod(l —«@). 


(Of course, 2 = —1 mod(l — w) since 1 — w divides 3.) 
Finally, we remark that « = + 1 mod(l — w)* implies that «? = + 1 mod 
(1 — w)**>. To see this, we write « = + 1 + B(1 — w)* and then 


a Fl=(«eF l(a Ft w)(a Fw?) 
= B(1 — w)*(BU — w)* + 1 F @)(BC — w)* + 1 Fo’) 
= (1 — w)'*7B(B + 1)(8 + (1 + @)). 


One of the three quantities B, B +1, and B+(1+@) must be divisible by 
1 — w. (Why?) Therefore a? $ | is divisible by (1 — w)**?. 


The arithmetic of Z(w) is relevant to the equation x° + y* = z? precisely 
because within Z(w) we have the factorization 


x? + yi =(x + y)(x@ + yw*)(xw” + yo). (2) 


The three factors x+y, xw +yw*, and xw*?+yw are all congruent 
mod(1 — w) and their sum is zero since | + w + w* = 0. In addition, taken 
in pairs they are either relatively prime or else have 1 — w as greatest common 
divisor as we see from the equations 
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(v — uw)(x + y) + w2(v — u)(xw + yo’), 
1—w= ((vw — u)(xw + yw”) + (uw — v)(xw* + yo), 
w*(v — u)(xw? + yw) + (u — vw)(x + y), 


where ux + vy = 1. (Recall that we are assuming (x, y) = 1.) 


One of the three quantities x,y, and z must be divisible by the prime 
1 —q@. Otherwise we have x, y, and z congruent to +1 mod(l — q), from 
which it follows that x3, y>, and z* are congruent to +1 mod(l — w)*. Then 
x3 + y? =z? implies 


+14+1= +1 mod(l —o)*, 
which leads to the impossible congruences 
0= +1 mod(l—q@)* and +2= +1 mod(l —o)*. 


Furthermore, we may always change notation to insure that it is z which is 
divisible by 1 — w, since x? + y? = z? is equivalent to the equations 


x3 +(-wz)?=(-wy)? and = y?+(—wz)? =(—ox)?. 


Among all the nontrivial solutions of x* + y? =z? in Z(w) for which 
1 — mw divides z, there must be one for which | — w divides z as few times as 
possible. (Why?) We may summarize our progress thus far in the following 
statement. If the equation x* + y? =z? has a nontrivial solution in Z(w), then 
it must have a nontrivial solution such that 


(1) x, y, z are relatively prime in pairs; 

(2) 1 —@ divides z; 

(3) there exists a number 4€N such that (1 — w)* divides z but (1 — w)**! 
does not; 

(4) in any other solution with properties (1) and (2), (1 — w) divides z. 


We shall show that given any such minimal solution, we can find one in which 
z is only divisible by (1 — w)*~', contradicting (4). 


Let x, y, and z bea nontrivial solution of x? + y* = z? satisfying conditions 
(1)-(4) above. Then z = 0 mod(1 — @) implies that 


x+y=xw + yw? = xw* + yw = 0 mod(1 — o), 


and we have x + y=(l—@)A, xw+ yo? =(1 —@)B, and xw? +yo= 
(1 —w)C where A, B, Ce Z(w) and A + B+C=0. Furthermore, since 


(x + y, xw + yw*) =(xw + yo”, xw? + yw) = (x? + yo, x + y) =(1 —@), 
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it follows that 
(A, B) = (B, C)=(C, A) = (I). 


In other words, the quantities A,B, and C are relatively prime in pairs. 
Moreover we have (z/(1 — w))? = ABC, and unique factorization in Z(w) 
implies that A, B, and C are associates of cubes, that is, 


Atay B= pir, "Cage. 


where a, B, and y are units of Z(w) and ¢, n, and @ are relatively prime in 
pairs. We have that o«By = (z/(1 — w)tn@)> is both a unit and a cube in Z(a). 
Since the units of Z(w) are just the sixth roots of unity, it follows that 
apy = +1. 

We know that | — w divides x + y, but not x or y. Therefore 


x = +1 mod(l -—o), = —xmod(l — ), 
and 
x*=+1modl-—o)*, y?=—x* mod(l —o)*, 
so that 
z>=x>?+ y?=0mod(l — o)*. 


Consequently, (1 — w)* divides z*> = (1 — w)° ABC, and (1 —q@) divides one 
and, since they are relatively prime, only one of the three quantities A, B, and 
C. This in turn shows that | — qm divides just one of the elements ¢, n, 0 € Z(w). 
Without loss of generality we may assume that | — w divides 0. 

Since 1 —q@ divides neither ¢ nor n, we have (>? and n° congruent to 
+1 mod(I — w)*. As a result A + B + C =0 implies 


al? + Bn? + yO? = +0+ 8 =0 mod(Il — o)?. 


Because « and fB are units, this is enough to show a= +f. This yields 
apy = ta?y = +1 = +a2 and y = +a. Therefore, eliminating «, B, and y, we 
obtain from A + B + C = 0 an equation of the form 


e,0° + €,9° + 830° =0, 
where e; = +1. Then setting x9 =&,C, yo = €&2n, and Z) = —e€30, we have 
x9 + Yo = 20. 


Furthermore, Zo is divisible by (1 — w)*~! at the most, since z> = (1 — @)?° 
(al*)(Bn?)(y0*) is divisible by (1 — w)*4 and 1—q@ divides 0 only. Thus we 
have arrived at a contradiction. 
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Chapter 6 


In this chapter we study those integral domains which have a unique 
factorization theorem for ideals. Such rings are called Dedekind domains, 
and their study is called classical ideal theory. We define a Dedekind domain 
to be an integral domain whose ideals have a certain property (invertibility in 
the field of fractions) and then prove that this is equivalent to unique factoriza- 
tion of ideals. Finally, we apply this theory to prove that the ring of integers in 
a Galois extension of the rational field Q is a Dedekind domain. From this we 
draw the conclusion that the Kummer rings are Dedekind domains. This 
re-establishes a form of unique factorization for the rings associated with 
Fermat’s last theorem, where the problem of unique factorization first became 
critical. 


Fields of Fractions 


176. Let R be a nontrivial integral domain. We shall construct a field Qz 
containing R as a subring by adding to R all the fractions r/s where r € R and 
se R* = R — {0}. Naturally Q, is called the field of fractions of R. The ele- 


174 
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ments of R are identified with the fractions having denominator 1. The general 
construction of the field of fractions Qz out of R is an exact parallel of the 
construction of the field of rational numbers Q out of the ring of integers Z. 
It is helpful to keep this in mind. 

We define an equivalence relation on the set R x R* by the rule (r,, 5;) ~ 
(r,, 52) if and only if r,s, =r,5,. (The reader should verify that ~ is an 
equivalence relation.) The equivalence class of (7, s) will be denoted r/s. 
Clearly r,/s, = r,/s, if and only if r,s, = r,s5,. We let Qp denote the set of all 
such equivalence classes. Addition and multiplication in Qp are defined by 


(2) (2) = (ee 
Sy 52 5152 
Clete 
51] \S2 r 5152 


It is necessary to verify that these operations are well defined, but we omit 
doing it. Qp is a field under these operations. The additive identity element 0 
may be represented as 0/s for any s € R*. The multiplicative identity element 1 
may be represented as s/s for any se R*. The additive inverse of r/s is —r/s 
and, if r #0, the multiplicative inverse is s/r. 

The mapping ¢: R >Q, given by ¢r = r/1 is easily seen to be a ring mono- 
morphism. This enables us to identify the element re R with the element 
r/1 € Qz and to think of R as a subring of Qp. 

We observe that for the ring of polynomials F [x] over a field F, the field of 
fractions Q,,, is the field F(x) of rational functions over F. (See 98a.) 


and 


176a. Let R bea nontrivial integral domain and F a field. Show that a ring 
monomorphism ¢: R— F can be extended uniquely to a field monomorphism 
d: QroF. 

176B. Let S be a subset of a ring R such that (1) S contains no zero divisors 
of R (hence 0¢ S) and (2) a,beS implies abe S. Construct a ring Ry of 
fractions r/s where the denominators s are elements of S, by imitating the 
field of fractions construction. 


176y. Show that any ring R is a subring of some ring with the property that 
every element is a unit or a zero divisor. 


1768. Show that the field of fractions of Z(,/— 5) is Q(,/—5). 


176e. Show that the field of fractions of the Kummer ring Z(p) is Q(p). 
(See 155.) 


177. The field of fractions Qz of a nontrivial integral domain R, being a 
field, has only two ideals, (0) and (1). We can, however, introduce a certain 
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class of subsets of Qp, called fractionary ideals of R, whose formal behavior 
resembles that of ideals of R. Fractionary ideals are important in the study of 
Dedekind domains. 

Let R be an integral domain with | # 0, that is, nontrivial. A fractionary 
ideal of R is an additive subgroup a of Qp, such that 


(1) ae€a implies rae a for all re R, 
(2) there is some r € R* such that rae R for all aea. 


Every ordinary ideal of R is a fractionary ideal of R. For clarity and emphasis, 
we call an ordinary ideal of R an integral ideal. Obviously a fractionary ideal 1s 
integral if and only if it is a subset of R. 

With every element r/s € Qpz we may associate the principal fractionary ideal 
(r/s) defined by 


, 


le rr 
(-) =|xeQulx=Z ren}, 
S S 


In other words, (r/s) contains all the multiples of r/s by elements of R. 

The notions of sum and product of ideals may be extended to fractionary 
ideals in the obvious fashion. Thus, if a and b are fractionary ideals of R, then 
a + b is the fractionary ideal containing all elements of the form a + b where 
aeaand be b. Furthermore, ab is the smallest fractionary ideal containing 
all products ab where ae a and be b. Of course if a and b are integral ideals, 
then so are a + b and ab. 


177a. Which of the properties given in exercises 160a, 1608, 1610, 1618, 
161y, and 1616 hold for fractionary ideals? 


177B. Compute the following for fractionary ideals of Z: 


(a) (1/2) + (2/3), 

(b) (1/2)(2/3), 

(c) (1/2)((2/3) + (3/4), 
(d) (1/2) m (2/3), 

(e) (1/2) 9 (2/3)(3/4). 


178. Ifaand 6b are fractionary ideals of R and b ¥ (0), then we define their 
quotient to be the set 


a/b = {x €Qpz|(x)b < a}. 


To see that a/b is again a fractionary ideal of R, we observe that it is clearly 
an additive subgroup of Qz which is closed under multiplication by elements 
of R, and that it has the special property of fractionary ideals. Specifically, if 
re R* is anelement such that rae R for alla €aand bis a nonzero element of 
b, then rb € R* is an.element such that (rb)x € R for all x € a/b. (Why?) 
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Note that even when a and Db are integral ideals, their quotient a/b need not 
be an integral ideal. Thus, the fractionary quotient a/b is formally a new 
operation; it is related to the integral quotient a: b by the equation a: b = 
(a/b) 0 R when a and Bb are integral. (See 161.) 


Proposition. Every fractionary ideal is the quotient of an integral ideal by a 
principal integral ideal. 


Proof. Ifa isa fractionary ideal of R, there exists an element r € R* such 
that rae R for all aea, or equivalently, such that (r)a is an integral ideal. 
We shall see that a = (r)a/(r). If x Ea, then (r)(x) c (r)a, and consequently, 
x €(r)a/(r). On the other hand, if x €(r)a/(r), then rx € (r)a, or equivalently, 
rx = sra for some sé R and aeaa. Since R is an integral domain and r #0, 
this implies x = saeéa. 


Corollary. If Ris a principal ideal domain where | # 0, then every fractionary 
ideal of R is principal. 


Proof. By the proposition, we know that every fractionary ideal of R has 
the form (r)/(s) for r, sé .R, s # 0. It suffices to observe that (r)/(s) = (r/s). 


178a. Prove the following properties of the quotient operation for fraction- 
ary ideals: 


(a) (a/b)b ca, 

(b) (a/b)/¢ = a/be, 

(c) a/(b + ¢) = (a/b) 4 (a/c), 

(d) (an b)/e = (a/c) 7 (6/c), 

(ec) Rea/b if and only if bea. 


178B. Compute the quotient a/b of the fractionary ideals a and b of the ring 
R in the following cases: 


(a) R=Z, a =(1/2), b = (2/3), 
(b) R=Z(i), a=(1/1 + 3), b= (7/2), 
(c) R= Q[x], a = (x/x + 1), b = (1/x). 


179. A fractionary ideal a of a nontrivial integral domain R is invertible if 
there exists a fractionary ideal b of R such that 


ab= (=A. 


If ab =(1), then clearly bc(l)/a. On the other hand, x e(1)/a implies 
x E(x) =(x)ab cb since (x)ac (1). Thus ab =(1) implies b = (1)/a. It is 
convenient to write a ' in place of (1)/a when a is invertible. 
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We note that a nonzero principal fractionary ideal (a) is always invertible 
and that (a)~ 1 = (a~'), or in other words, (1)/(a) = (1/a). In general there are 
invertible fractionary ideals which are not principal, but we can prove the 
following. 


Proposition. An invertible fractionary ideal is finitely generated. 


Proof. Let a be an invertible fractionary ideal of R. Since 1 € (1) = a7!a, 
we have 


, 


1 =a\a, + a,a,+-++-+a,4, 
for some elements aj, a,,...,a,,¢a ' and a,,a,,...,a,€a. If xea, then 


the elements r, = xa}, r. = xa,,...,7, = xa, belong to R and we have, 
multiplying the equation above by x, 


X= Git "2@ocr ot Fe, 
This shows that a,, a,,..., a, generate a, that is, 
a = (4, a2, w++y Gy) = (@) + (22) +°+> + G,). 


179a. Prove that if a and b are fractionary ideals and b is invertible, then 
Gio abe”. 


179B. Prove that if a and b are invertible fractionary ideals, then a c b if and 
onlyif 6 Yeas? 


179y. Prove that two fractionary ideals are both invertible if and only if 
their product is invertible. 


1796. Let R be a nontrivial integral domain. Show that the invertible 
fractionary ideals of R form a group under the product of ideals. 


179. Suppose that a, b, a+b, and an bare invertible fractionary ideals. 
Show that 


(a+b) '=a!nb"}, 
(anb)-'=a7'4+671. 


1796. Show that the ideal (x, y) of Q[x, y] is not invertible. 


179n. Compute the inverse of the ideal (2, 1 + /=5) in Z(«/ Oe 
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Dedekind Domains 


180. A Dedekind domain is a nontrivial integral domain in which every 
nonzero fractionary ideal is invertible. Every fractionary ideal of a principal 
ideal domain is principal (178) and, if nonzero, invertible (179). Therefore a 
nontrivial principal ideal domain is a Dedekind domain. 

Richard Dedekind (1831-1916) showed that Kummer’s ideal elements, 
which seemed mysterious and artificial, could be viewed as subsets of a ring 
having special properties (closure under addition and closure under multipli- 
cation by ring elements). Dedekind called these sets ideals and showed that 
in many domains they possess a unique factorization theorem. We shall see 
that a nontrivial integral domain has unique factorization for ideals if and 
only if it is a Dedekind domain. 


180. Show that a nontrivial integral domain is a Dedekind domain if and 
only if for each integral ideal a there exists an integral ideal b such that ab 
is principal and ab is nonzero if a is. 


1808. Show that a nontrivial integral domain is a Dedekind domain if and 
only if for any two integral ideals a and b, ac b implies a = be for some 
integral ideal c. 


180y. Prove that in a Dedekind domain an ideal is prime if and only if it is 
not a product of two proper ideals. 


181. Proposition. Every proper prime ideal of a Dedekind domain is a 
maximal ideal. 


Proof. Suppose p is a proper prime ideal of a Dedekind domain R and 
that p c a where a is an integral ideal of R. Then a~'p ca ‘a= R, and there- 
fore a~'p is an integral ideal of R. Now a(a 'p) = p, and therefore either 
ac pora 'pc p (163). Since p < a by hypothesis, a c p implies a = p. On 
the other hand, a 'p c p implies a”! c pp”? = R from which it follows that 
Rca and a= R (179a). We have shown that pc a impliesa=p ora=R. 
Therefore p is maximal. 


Corollary. If aand yp are proper ideals of a Dedekind domain, p is prime, and 
a ¢ p, then ap" =an p" for anyneN. 


Proof. Since a ¢ p and p is maximal, a + p = (1) = R. Therefore we have 
| = p+awhere pe p, aea. Consequently 


1=I"=(p+a)y'=p"+raep" +a, 
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where 


If xean p” then 
x = x(p" + ra) = p"x + rax € ap”, 
which shows that a q p” < ap”. Since ap” can p”, equality follows. 


18la. Showthat for distinct proper prime ideals p,, p2,..., p, of a Dedekind 
domain, 


PPD PR = PL PY OO PK 


182. Theorem. Ina Dedekind domain every proper ideal can be factored as a 
product of proper prime ideals. Furthermore this factorization is unique 
except for the order of the factors. 


Proof. We already know that every proper prime ideal of a Dedekind 
domain is maximal (181) and that every nonzero ideal, being invertible, is 
finitely generated (179). (Since (0) is principal, we can say that every ideal of a 
Dedekind domain is finitely generated. Rings with this property are called 
Noetherian.) 

Let R be a Dedekind domain. If a is a proper ideal of R which is not a 
product of proper prime ideals of R, then a cannot be a prime ideal because 
we construe “‘ product”’ to include products of length one. Therefore a is not 
maximal since maximal ideals are prime (162). Consequently, there is an ideal 
b of Rsuch thata cbc Randa¥b¥ R. Furthermore, ab~! c bb™! = Rso 
that ab~! is an integral ideal. Clearly a factors as (ab~')b. One of the factors 
ab~! or b must fail to be a product of proper primes—otherwise a would be. 
Now a # b and a # ab“! (why?), and we may say: ifa is a proper ideal which 
is not a product of proper primes, then there exists a strictly larger proper ideal 
a’ with the same property. (a’ = b or a’ = ab~’.) 

By iterated use of this statement, we can derive from the existence of a single 
ideal with no proper prime factorization the existence of an infinite, ascending 
chain of ideals of R, 


eaten ee acer () 


in which a“ ¥ a+), (See 183.) We shall see that R does not admit such 
chains. It is routine to verify that the union 


ie] 
W= Va” ={reR|rea”,neN} 
n=0 
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is again an ideal of R. Consequently, QI is finitely generated by elements 


a,,a,,...,a,€R. Foreachi=1,2,..., k there is a natural number n; such 
fierce”. As aresult.a),a,,...,a,¢a™ forn > max{n,, n,,...,7n,}. This 
implies 

2) "awa, 7.4,@).2ae 


for large n, and since a” < Q for all n, we have a“) = QI when n is sufficiently 
large. Of course this contradicts (for large n) the fact that a” 4 a“*”, This 
contradiction shows that we cannot have a proper ideal of R which is not a 
product of proper primes. 

It remains to show uniqueness. We proceed by induction on the number of 
primes in the factorization, that is, we prove by induction the statements S,, 
forneN. 

S,: If ais a proper ideal of R which can be factored as a product of n (or 
fewer) proper prime ideals, then any two factorizations of a are the 
same except possibly for the order of the factors. 

To start the induction, we remark that S, is almost obvious. An ideal which is 
the product of one proper prime ideal is itself a proper prime and has no true 
factorizations at all. Now we assume S, and prove S,,,,. Suppose that 


Q=PyP2 °° Pe = 4192 °°° U 


are two factorizations of the proper ideal a and k <n + 1. Since ac q, and 
q, is prime, one of the factors p,, p,,..., p, must be contained in q, (163). We 
may suppose p, € q,. Since p, is a proper prime and, consequently, maximal, 
we must have p, = q,. Now, however, we see that 


BPp = PiPo Pe) — Ie 1, 


which falls within the scope of S, since k — 1 <n. Therefore these last two 
factorizations are the same, and the given ones must have been the same. 


Corollary. Ina Dedekind domain every proper ideal may be written uniquely in 
the form p\'ps? +++ py, where py, P2, ---, Px are distinct proper prime ideals 
Gd Ven Vo5a.., V, EN. 


182a. Prove that in a Dedekind domain every proper fractionary ideal can 
be written uniquely in the form p}'p3? --: py*, where p,, p,..-., p, are distinct 
prime ideals and v,, v,,..., v, are nonzero integers. 


182. Prove that in a Dedekind domain every proper (integral) ideal can be 
written uniquely in the form p}' nN p32? 0°-: A py, where p,, Po, --., P, are 
distinct prime ideals and v,, v,,..., y.EN. 


1827. Letaand b be fractionary ideals of a Dedekind domain and p,, p,,..., 
p, be prime ideals. Suppose further that 


a=pi'pre- py and b= pl'py?--- pit. 
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Show that a c b if and only if v; > yu; fori=1,2,...,k. (Here the v;s and 
i;’s are allowed to be any integers.) 


1825. Using the notation of the previous problem, show that 
anb= praxis) parCany) o00 pe ives Hay 


ab = pi tis pyztHe ... puerta, 


a+b= prpin(a, 1) pipin(v2. u2) 409 pines Bk) 


182e. Show that for any three fractionary ideals a, b, ¢ of a Dedekind 
domain, 


a+(bonc)=(a+b)n(a+ 0), 
an(b+c)=(anb)+(anc). 


183. The Axiom of Choice. The preceding proof depends upon a set- 
theoretic principle, called the axiom of choice, which has a different character 
and a higher order than the simple rules of intuitive set theory which have 
served us to this point. It is worth a digression to make explicit this principle 
and its use in the present instance, the only place we need it. 

Axiom of Choice. For any set X there exists a mapping 


p:2* +X 


such that 6A € A for each nonempty subset A of X. (Recall that 2* denotes the 
power set of X defined in 14.) The mapping @ is called a choice function for X 
because it ““chooses”’ an element ¢A from each nonempty subset A. 

In the proof of 182 we must apply the axiom of choice to the set ¥p of 
ideals of R in order to construct the ascending chain («). Let #: 278 > Fz bea 
choice function for %,. If a is any ideal without a proper prime factorization, 
let (a) denote the class of ideals which contain a properly and which them- 
selves have no proper prime factorization. The first argument in 182 shows 
that for such a, G(a) is not empty. Therefore the chain (*) is defined inductively 
by the rule a“"*? = ¢(G(a™)), 


184. The Chinese Remainder Theorem. Let a,, a,, ..., 4, be ideals of a 
Dedekind domain R and a,, a,,..., a, elements of R. There exists an element 
x € R, such that 


xX =a;modaq,, a5 Dodo M. (*) 


if and only if a; =a, mod (a; + a;) for all i and j. 
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Proof. The conditions a; =a; mod (a; + a,) are clearly necessary. The 
proof of their sufficiency proceeds by induction on n beginning with n = 2. 
For n = 2, a, = a2 mod (a, + a.) implies that 


a, —a,=a,+a,€a,+ 4), 


where a, €a,, a, €a,. Then x =a, — a = a, + a) has the required proper- 
ties. 

For n > 2, we may suppose, as a result of the induction hypothesis, that 
there is an element x’ € R such that x’ = a; mod a; fori = 1, 2,...,n— 1. Let 


(i SS iy (A Oley (OY SOR (OM 6 


Then x will be a complete solution to the set of m congruences (*) if x = x’ 
mod a and x =a, moda,. There exists such an x € R provided that x’ = x, 
mod (a + a,). However, 


ne 1 reall 
ata,=(()a) +4,=() (a+ 4,), 
i=1 i 


=1 


which follows from 182¢. Since x’ = x; mod a; for i=1, 2,..., n—1, we - 
have 

x’ =x,;=x, mod(a;+a,) fori=1,2,...,2—1. 
This means that fori=1, 2,...,n—1, 


x’— x,€a;+ a, 
and therefore 
x'—x,€a+a,. 
Thus the condition x’ = x, is satisfied and (*) has a solution xe R. 
As an example, this theorem implies that the set of congruences 
x =1 mod 6, x =5 mod 8, x =4mod 9 


has a solution. Indeed any integer x = 13 mod 72 1s a solution. The name of 
this theorem recalls its use (for integers) by the Chinese astronomers of 
ancient times in calender reckoning. 


184a. Let R bea Dedekind domain and a a proper ideal of R with factoriza- 
tion a = p\'p}? --- py*. Prove that R/a is isomorphic to the direct product 


(R/pi') x (R/p3) x +++ x (R/Pi*). 
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184B. Let a,,a,,..., a, be distinct elements of a field F. Using the Chinese 
remainder theorem, show that there exists a polynomial f over F which takes 
given values f(a;) = c; for i= 1, 2,..., n. (Compare 100.) 


184y. Prove that a Dedekind domain with a finite number of prime ideals is a 
principal ideal domain. 


1845. Prove that every ideal of a Dedekind domain can be generated by two 
of its elements, one of which may be chosen arbitrarily. 


184s. Let aand b be ideals of a Dedekind domain. Show that there exists a 
principal ideal (c) such that (c) +ab =a. 


185. In the next articles we shall develop a proof of the converse of the 
unique factorization theorem for ideals in a Dedekind domain (182). It will 
be convenient to call an ideal primigenial if it is a product of proper prime 
ideals. A primigenial ring is one in which every proper ideal is primigenial. 


Proposition. In a nontrivial integral domain factorization of invertible pri- 
migenial ideals is unique. 


Proof. Suppose ais an invertible ideal of a nontrivial integral domain, and 


a= PyP2°°° Pe = 9192 °°° 


are two factorizations of a as a product of proper primes. Each of the p;’s and 
q,’s is invertible (179y); in fact we have 
pr) =a ppp wisn ee 

and a similar formula for q; '. Among the primes p,, pz, ..., P, choose one 
which is minimal in the sense that it does not contain any of the others pro- 
perly. We may assume that this is p,. Since a = q,q, °°: q, iscontained in p, , 
a prime, at least one q; is contained in p, (163). We may suppose q, < p,. 
Similarly, p,p. °°: p, < q, implies p; < q, for some i. However, now we have 
p,q, p,, and hence p; c p,. The choice of p, forces p; = p,, and then 
P, © 4, € p, implies q, = p,. Now we have 


a’ = apy) = aq, * = Pipe °° Pea = 192° Ai-a- 


The ideal a’ is invertible because it is a product of invertible ideals. Therefore 
the entire argument may be repeated with a’ in place of a. The conclusion 
follows in a finite number of steps of this kind. 


186. Proposition. If Ris aprimigenial ring and p is a proper prime ideal of R, 
then the quotient ring R =R/p is a primigenial ring. 
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Proof. Since p is a proper prime ideal, R is a nontrivial integral domain. 
Let ¢: R>R denote the canonical epimorphism, given by ga = a + p=a. Since 
¢ is an epimorphism, it carries ideals of R to ideals of R and preserves pro- 
ducts of ideals, that is, (ab) = (fa)(pb). (We leave the proof to the reader.) 

Furthermore, ¢ gives a one-to-one correspondence between the proper 
prime ideals of R and the proper prime ideals of R which contain p. Indeed, 
if G is a proper prime ideal of R, then ¢~1(G) is a proper prime ideal of R 
containing p, as we have previously shown (166). On the other hand, if q is a 
proper prime ideal of Rand ab € ¢(q), then abe q, where da = a, bb = b. Con- 
sequently, a€ q or be 4q, from which is follows that a € ¢(q) or b € ¢(q), and 
thus $(q) is prime. 

Now suppose that a is a proper ideal of R. Then ¢~‘(a) is a proper ideat of R 
and has a factorization ¢~'(@) = p,p2 °°: P, as a product of proper primes 
each of which contains p. Therefore in R we have the factorization 


a= pp '(4) = (Pi )PP2) *- (PP) 


187. Theorem. A primigenial ring is a Dedekind domain. 


Proof. Let R be a primigenial ring. The essential point is to show that 
every invertible proper prime ideal p of R is maximal. Suppose that p is an 
invertible prime ideal and (0) #p+#(l). Let ae R—p. Then the ideals 
p + (a) and p? + (a) have prime factorizations 


pt+(a)=pip2.°*° Pp, and p? +(a)=qyq2 °° aq). 
Clearly each p, contains p, but further, we have 
PPiP2 “Px = P(P + (a)) = p? + p(a) c p? + (a) cQ,, 


which implies that each q; contains one of the factors of p? + p(a). Each of 
these factors, p, p;, P2, ..-, Px, contains p, and therefore each prime q; con- 
tains p. Passing to the quotient ring R = R/p, which is also primigenial by 186, 
we see that 


Did. °°, =P + (4) =(@) =P" + (@) = 4,42 °° Gh, (*) 


where bars denote images under the canonical epimorphism ¢: R— R. Since 
(a) is principal and therefore invertible, the two factorizations of (a) in (*) 
must be identical except for order (185), and renumbering if necessary, we may 
assume that p, = 9; fori =1, 2,...,/=k. Since all the p,’s and q,’s contain 
p, it follows from}, = G, that p, = q;foreachi. As aresult we have p + (a) = 
p? + (a), from which we conclude that p c p? + (a). Consequently, 


p=pa(p?+(a))o p? + (apc p. 
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Therefore p = p? + (a)p, and since p is invertible, this implies 
R=p'p=p '(p’+@p)=p+@. 


Because a was any element of R — p, this shows that p is maximal. Thus we 
have established the essential point. 

The rest of the proof is easy. Let p be any proper prime ideal of R and 
ae p. Then (a) = p,p2 --: p, © p for primes p,, each of which is invertible and 
consequently maximal. However, one of the p,’s must be contained in p since 
their product is. Now p; ¢ p implies p = p; (since p; is maximal), and we have 
shown that any prime ideal is invertible. Since every proper ideal of R is a 
product of prime ideals, it follows that all such ideals are invertible and R is a 
Dedekind domain. 


Integral Extensions 


188. Let R be a subring of a ring R’. An element x € R’ is said to be integral 
over R if it satisfies an equation of the form 


x + ax" +-++4a,_1x +a, =0 


in which the leading coefficient is 1 and the other coefficients, a,, a,,..., G,; 
are elements of R. The set of elements of R’ which are integral over R is 
called the integral closure of R in R’, and will be denoted R. When R = R, we 
say that R is integrally closedin R’. When R = R’, we say that R’ is an integral 
extension of R. 


189. A ring R’ is a finite extension of a subring R if there exist elements 
Z1, Z2,-.., Z2,€ R’ such that every element ze R’ can be written in the form 
Z=F;2Z; + 1222 + pig: ae C49 


for some elements r,,r,,...,r,€R. In this case the set {2,, Z.,..-, 2,} 1S 
called a basis for R’ over R. 


Proposition. If R’ is a nontrivial integral domain which is a finite extension of a 
subring R, then R’ is an integral extension of R. 
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Proof. Let {z,,2Z2,...,Z,} be a basis for R’ over R. Then for any ze R’ we 
have zz; =) 5) r,jz;, where r;; € R for i, 7=1, 2,..., 1 Thus we may view 
Z,,Z2,--.,Z, aS the solution of the system of linear equations over Qpr. in 
unknowns X,, X2,..., X,: 


(yy — 24, +72 X, +" ~e — 0 
ra1Xy + (ra2 — ZX. ++ + ra, X, = 0 


rai Xy i ln2 - Tei Ulin Ps z)X, = 0. 


Therefore the determinant 


Ny Z 12 ae” lin 
r Inn —Z r 
21 a 2n = 
lar ln2 lan — Z 


However, this determinant is just a polynomial in z with coefficients in R and 
with leading coefficient (— 1)", which shows that z is integral over R. Thus 
every element ze R’ is integral over R, and R’ is an integral extension of R. 


189a. Show that if R’ is a finite extension of Rand R’ is a finite extension of 
R’, that R” is a finite extension of R. 


190. If Ris a subring of R’ and xe R’, we denote by R[x] the smallest sub- 
ring of R’ containing both R and x. Clearly R[x] consists of all the elements of 
R’ which can be written as polynomials in x with coefficients in R. 


Proposition. An element x of a nontrivial integral domain R' is integral over a 
subring R if and only if R[x] is a finite extension of R. 


Proof. By the preceding proposition if R[x] is a finite extension of R, then 
it is an integral extension of R, and consequently x € R[x] is integral over R. 
On the other hand, if x is integral over R and satisfies the equation 


Rotax +s 4a, X+4,=0, yy Gg, ER, 


then every element of R[x] may be written as a polynomial in x of degree less 
than n. (How?) Therefore {1, x, x?,...,x""'} is a basis for R[x] over R. 


191. Proposition. The integral closure R of a subring R of a nontrivial 
integral domain R' is a subring of R’. Furthermore R is integrally closed in R’. 
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Proof. Suppose x,y¢R. Then R[x] is a finite extension of R, and 
R[x, y] = (R[x])[)] is a finite extension of R[x] because x 1s integral over R and 
y is integral over R[x]. It follows that R[x, y] is a finite extension of R. (See 
189.) In fact, if x and y satisfy equations of degree n and m respectively over 
R, then the set of nm elements x‘y/ where 0 < i < nand0 <j < misa basis for 
R[x, y] over R. Since R[x, y] is a finite extension of R, it is an integral exten- 
sion (189); consequently, the elements x — y and xy of R[x, y] are integral 
over R. This shows that R is a ring. 

Suppose now that x is an element of R’ and that x is integral over R. Then 
x satisfies an equation 


M+ Gx 4+ 44-8 +4 =0, » Gy dou. 0 pee 
It follows that we have a sequence of finite extensions 
R c R{a,] = Rid,, a>] (ecm CT ecm Ria,, a2, oeeg an] (‘ss R{a,, a, erevels a; x]. 


Therefore R[@,,4,,...,4,, x] is a finite extension, and consequently, an 
integral extension of R. Thus x is integral over R, that is, x € R. 


Algebraic Integers 


192. We recall that an algebraic number is a complex number which is 
algebraic over the rational field Q and that the set of all such numbers forms 
a field Q (108). Analogously, an algebraic integer is a complex number which 
is integral over the ring of integers Z. By the preceding proposition we know 
that the set of algebraic integers forms a ring Z. Of course Z is just the integral 
closure of Z considered as a subring of C. Furthermore, since every element of 
Z satisfies an algebraic equation over Z, it is clear that an algebraic integer is 
an algebraic number, that is, Zc Q. 

In general the algebraic integers which belong to a given number field F are 
called the integers of F and form a ring which we denote Z,;. Obviously 
Zp = ZF. By definition we have Zc = Z. 


Proposition. The ring of integers Z is integrally closed in the rational field Q, 
that is to say, Ze = Z. 


Proof. Suppose that r/seQ is an algebraic integer where r,seZ and 
(7,8) =A1. Then for somea), a3. ses, 6 Z, 
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()+a()" + +au(t) +a=0 (1) 


n-1 


and 

PP ar sh tea, iets” +48" = 0. (2) 
Equation (2) shows that s|r”. Since (r,s) = 1, we must have s= +1 and 
r/s = +reZ. This completes the proof. 


The elements of Z are often called rational integers to emphasize that they 
are the integers of the rational field Q. 


Proposition. For every algebraic number « there exists a rational integer m 
such that ma is an algebraic integer. 


Proof. Wemay assume without loss of generality that « is a root of a 
primitive polynomial f given by 


Tea ax + + a,x + a,, Bis Qs oss, Qe 
Then ay is a root of the polynomial g given by 
Pee x ta ++: + 4,210) °"X +a, °- 
Since g has integral coefficients, aya is an algebraic integer, and we may take 
m=. 


192a. Let R be a number domain (subring of C) which is a principal ideal 
domain. Show that R is integrally closed in its field of fractions Qp. 


192B. Show that the ring of integers of the field Q(/) is Z(i), the ring of 
Gaussian integers. 


1927. Show that the polynomial ring Z[x] is integrally closed in its field of 
fractions. 


1926. Let « be an algebraic integer of E, a Galois extension of Q with group 
G(E/Q) = {¢;, 62, .-.-, >}. Prove thatall the conjugates ga, 2 a,..., 6,0 of 
a are algebraic integers and that the polynomial 


(x — bya)(x — b2 4) °* (X — On) 


has coefficients which are rational integers. 


193. Proposition. If f is a polynomial over Z and « is a root of f, then 
(fx)/(x — a) is also a polynomial over Z. 
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Proof. The proof is by induction on the degree of f, When deg f= 1, we 
have fx = a(x — «) where ae Z and the result is obvious. To accomplish the 
induction step, we suppose the statement holds for polynomials of degree less 
than n. Suppose deg f= n and 


SX =A xX" + a,x" +++) 4+4,.x+4,. 
It follows from the argument of the preceding article that: 


(1) a) € Z, 

(2) the polynomial g defined by fx = a) x" '(x — a) + gx has coefficients 
in Z, 

(3) deg g < n. (Why?) 


Therefore, by the induction hypothesis (gx)/(x — «) has coefficients in Z, 
and consequently so does f since 


Corollary. If %, %,...,% are roots of the polynomial f over Z given by 
fR=QoX' +.4,3""' $e oe ae 


then Ay &,%2 *** &, is an algebraic integer. 


Proof. Let 41, &j+2,.--+,) %, denote the remaining roots of f. Applying 
the proposition n — k times shows that the polynomial 
f(x) 


= 6 Se 

(x = py Ue = 24-3) — @,) : 

has coefficients in Z. However, dy a, «** a, is the constant term of this poly- 
nomial. 


194. In this and the following two articles we shall consider the situation 
where R is the ring of algebraic integers of a number field E which is a Galois 
extension of the rational field Q. To fix notation, we assume that [E: Q] =n 
and that the Galois group is #E/Q) = {¢,, 62, ..., da}. We shall omit 
repetition of these assumptions in the hypotheses of the propositions. 


Proposition. There exists a basis w,, @,,..., @, for E over Q such that every 
element « € R can be written uniquely in the form 


a= a,0, + a,@,+°''+4,0,, 


where a,,@,,...,4,€ Z. 
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Proof. Since every algebraic number can be multiplied by an integer to 
obtain an algebraic integer, we may begin with the assumption that we have 
a basis @,, @,,..., @, for E over Q consisting of algebraic integers. Now 
we form the determinant 


Pi®, $10, *** $6, 
-_ 2 O $2 O2 = $2 Oy 


$,0, nD ae nD, 


We observe that 6 is an algebraic integer, or more specifically, that 6 € R since 
each of the elements ¢; @, belongs to R. (1925). Furthermore, for ¢ € @(E/Q) 
we have 


9,0, 66,0, +": $10, 
$5 = $2 $2 ®2 met $$2 O, 


$bn0, $¢,0, ° $,0, 


However, the effect of letting ¢ act this way is simply to permute the rows. 
Consequently, 66 = +6 for any ¢ € G(E/Q). Therefore 5? € Q. Since 6? is an 
algebraic integer, it follows that 67 eZ. Now we set w; = @,/6*. Clearly 
™,,@2,.--, @, form a basis for E over Q. 

Suppose now that « € R and that we have written « as 


where @,,4,,...,4,€ Q. Applying ¢,, $2, ..., ¢, to a, we obtain: 


$i% = 4,($,@,) + 42(6,02) + +++ + 4,(G,0,) 
2 Cae a(e2 @,) + a(b2 5) He are Al2 @,) (+) 


; a= a(t, @,) + ald, @2) poe aid, @,)- 
We may therefore interpret @,,@,,...,d, as the solution of the system (*) of 


linear equations over E. Cofieoquently a; = 6,/6, where 6 is the determinant 
above and where 


(We 2) se Was 
$1, GO, **' Gye +++ G1, 
— $2 $2 D2 es 2 tee 620, 


$n Dy $, Dy mr b, a oe Pn 3, 
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is obtained from 6 by modifying the i-th column. Clearly 6; is an algebraic 
integer. Furthermore 67a; = 66; € RA Q = Z. Now we set a; = 674; € Z, and 
we have 


4=4,0, + 4,0,+°:'+4,0, 


- - @2 = @ 
= 08, 5 + PAG + OOS 


= 4,@, + a, @, ae oh o Sadi @).. 4 


Since @,, @2,...,@, form a basis for E over Q, this final expression for a 
must be unique. 


194. Prove the proposition above assuming only that £ is a finite extension 


of Q. 


195. Proposition. Ifa is an ideal of R, then there exist elements «,, &,..., 
“, € a such that every element a € a may be written in the form 


X= C0, +024, +°°'+C,4,, 


where C1, C2, -.+5 GEL. 


Proof. We define mappings f,, /2,...,/, fromato Z by f;a =a;, where 
a4 =a,o, + a,W,+°*':+a,@, is the unique expression of « in terms of a 
basis w,, @2,...,@,, chosen as guaranteed by the preceding article. The f; 
are not ring homomorphisms in general, but they are homomorphisms of the 
additive group structure. Therefore, the sets 


A, =Imf, =f), 
A, =f,(Ker f;), 


A, =f.(Ket fp Kerf, a "ener, =) 


are subgroups of Z. Therefore there are integers k,,k,,...,k, such that 
A; =k; Z. It follows that we can choose elements «,, #,..., %, € a such that 
fia; =k; and fja; =0 for j <i. 

Given ae a, we have f;a = c,k, for some c, € Z. Then f,(@ — c,a,) = 0 and 
F(a — c,%,) = c,k, for some c, € Z. Then 


S(@ — €4%, — C2 %2) = fo(a — C44, — C2 a2) = 0. 
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Continuing in this manner, we obtain c,, c,,..., c, € Z such that 
Sa — C40, — C24, —*** —c,4,) = 0, [12 esti, 
which implies that « = c,a, + ¢c,4, +:''+,4,. 


Corollary. The ring R is Noetherian, that is, every ideal of R is finitely 
generated. 


Proof. In the notation of the proposition we have that a is generated by 
Ay, Aa, --., Qy- 


195a. Prove that an integral domain R’ which is a finite extension of a 
Noetherian domain is itself Noetherian. 


1958. Prove that a ring is Noetherian if it is a quotient ring of a Noetherian 
ring. 


195y. With R as above and n € Z show that R/(n) is finite. 


196. Theorem. The ring of algebraic integers in a Galois extension of the 
rational field Q is a Dedekind domain. 


Proof. As above we let E be a Galois extension of Q of degree n with 
G(E/Q) = {¢,, 2, ---, >} and R= Z, = Zo E. It is enough to show that 
for any nonzero ideal a of R there exists a nonzero ideal b of R such that 
ab = (c), a principal ideal; this implies that a(b/(c)) = R, and a is invertible. 

Suppose then that a is a nonzero ideal of R. From 195 we know that a is 
finitely generated, say a = (a, a, ..., &). We form the polynomial 


fe = Og X* + yx tee + yi x + H 
and let p,, p2,---, Py € C denote the roots of f, Then it follows that 
a; =(—1)'o%9 (p15 Pr, +--+ Pus 


where g; denotes the i-th symmetric function (131). For ¢ € G(E/Q) we let of 
denote the polynomial given by 


(¢f)x = (pao)x* a (pa, )x*-! + °° + (Go, —1)% + (Ge,). 


The coefficients of @f are elements of R, that is, algebraic integers in FE, and 
the roots of df are op;, dp2,..., dp,. It follows that 


h= (if G2f) nf) 
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is a polynomial whose coefficients are algebraic integers of E and, at the same 
time, rational numbers since ¢h=h for ¢e G(E/Q). Consequently the 
coefficients of A are rational integers. Furthermore, one element of GY £/Q) is 
the identity automorphism of £, and therefore fis one of the factors of h. In 
other words, h = fg where g is a polynomial with coefficients in R. We let 


gx = gee + be ae ae 
and 
hx = Yox” ap ‘sae Slee kat esneosr Vm> 


where m = k + /. We denote the roots of g by 1,, T2,..., 7,3 then the roots of 
hare 


Pir P25+++5 Pho T1s T25+++5 Ty. 


Since h has integral coefficients, we can write h = ch where h is a primitive 
polynomial and c is the content of h, which is to say, the greatest common 
divisor of the integers yo, 71, --->Ym- Lhe coefficients of h are 


(Yo/e), (y:/c), I (YmlC), 


all of which are integers, while the roots of h are the same as those of h. 
Therefore we know from the corollary in 193 that 


Col) Psi Aa”?! Ose TaN rs we Wee (*) 
is an algebraic integer for any choice of integers, 
los, <s, =" "= 5 =k, Dry < <A 
As a result, & Bo = Yo implies 
a; B; = ((— 1)'a F(P1, P2>+++s Pr)IM(—1)/Bo Ot, 352s eeep 


= = Neg Bo oi(p1, Prseres px)o (7, > T,) 
a (—1)'*/e{(yo/e)o i(p,, P2 OO) pP,)O (T,, T2 Smusienens) T,)}. 


In the last equation the term inside the braces is the sum of all the terms of the 

form (*); therefore it is an algebraic integer and an element of R. The result is 

that «; 8; €(c), the principal ideal of R generated by the rational integer c. 

(Of course this holds for any choice of iandj,0 <i<k andO<j</.) 
Now the coefficients of the polynomial g generate an ideal 


b = (Bo, By, .--, Bi) 


of R, and the coefficients of the polynomial h generate the principal ideal 
(c) = (Yo: V1) +++» Ym): The product ideal ab is generated by all the elements 
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a;B;, each of which belongs to (c), and consequently ab c (c). On the other 
hand, 


a = Xo B; ate 0,32; geo Tes 0;-1B; oT a; Bo 


(if we take a; = 0 for j > k and B; = 0 for j > /). Thus the generators of (c) all 
belong to ab and (c) < ab. We may finally conclude that ab = (c), which com- 
pletes the proof. 


196a. Prove that for any nonzero ideal a of R, R/a is finite. 


196B. Prove that the ring of integers Z, in any finite extension F of Q is a 
Dedekind domain. (This result is sometimes called the fundamental theorem of 
algebraic number theory. The theorem above is a special case from which the 
general case follows.) 


197. Normand Trace. Let the number field E be a Galois extension of 
Q with the group ¥(E/Q) = {¢,, d2,...,¢,}. To any element ae€E we 
assign two rational numbers, Na and Ta, called respectively the norm and 
trace of a. These are defined by 


Na = (a)(G2 0) *** (Pn) 


and 


Ta = (p14) + (624) + °°: + (G0). 
We make the obvious observations about norm and trace: 


(1) Norm is multiplicative, that is, N(«#B) = (Na)(NB). 
(2) Trace is additive, that is, T(a + B) = (Ta) + (TB). 
(3) Ifa is an algebraic integer, then Na and Tx are rational integers. 


197a. Show that when [E: Q] = 2, an element ae E is an algebraic integer 
if and only if Na and Ta are rational integers. 


197B. Determine the integers of Q(./—5). 


198. Theorem. For p prime and p=e?""?, the Kummer ring Z(p) is a 
Dedekind domain. 


Proof. In view of the preceding theorem (196), we need to establish only 
that Z(p) is the ring of integers of Q(p), which is the splitting field of x? — 1 
over Q and consequently a Galois extension of Q. 
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We recall from 134 that the Galois group ¥(Q(p)/Q) is isomorphic to Z, 
and that it consists of the p — 1 automorphisms ¢,, ¢,,..., 6,-, determined 
by ¢, p = p*. We also recall that the irreducible monic polynomial for p over 


Q is the cyclotomic polynomial ®, given by 
Dix = xP $ xP He + X41 = (x — px — p?) +++ (x — p??). (*) 


If we take x = 1 in (*), we get ®,(1) = p = N(1 — p). This shows among other 
things that 1 — p is prime in Zq,), the ring of algebraic integers of Q(p). 

Next we remark that (1 — p) 1 Z=(p) in Zq,) or, in other words, a 
rational integer divisible by (1 — p) is divisible by p. This is easy to see since 
(1 — p) \ Zis a proper ideal of Z and contains (p), which is a maximal ideal. 

Let a be an algebraic integer of Q(p), that is, an element of Zap). Since the 
numbers p, p”, ..., p?~' forma basis for Q(p) over Q, wecan write « uniquely 
in the form 


a= a,p a5 a, p? te a Gaga 
where @;, a, ..., @,-1€Q. We will have ae Z(p) if we can show that 


@,,@,...,@,-, € Z. First we remark that 


ky k pore ae (p-1)k — wm [p> af pbk, 
14+ 7(p=1+ p'+p*+-:-+ Oo) =15 i¢ oe 


Therefore 7(1)=p—1 and 7(p*)= —1 for k=1,2,...,p—1. Next we 
compute the trace of (1 — p)p~‘« for i= 1, 2,...,p — 1. We have 


=i) 


TIA — p)p'al = 710 = po" ap") 


j=1 
pal nee 
= ) aTll— per) 
p-1 Mi A 
= es ali’ est 
pa, if i=1, 


~ | pla, —ajz3) 1 ere 


However, (1 — p)p~! is an algebraic integer divisible by 1 — p, and conse- 
quently its trace is a rational integer divisible by 1 — p (why?) and, hence, 
also divisible by p. As a result all the numbers a,, a, — a, ..., @p-1 — Gp-2 
are rational integers, from which it follows that « € Z(p). Thus we have shown 
that Z(p) > Za). Therefore Z(p) = Zap) and we are finished. 


198. Show that in Z(p) the ideal (p) has the factorization (1 — p)?7!. 
198B. Let g bea rational prime. When is (q) a prime ideal of Z(p)? 


Bibliography 


General References 


Artin, Emil, Galois Theory, second edition, Notre Dame Mathematical 
Lectures, No. 2. 


Birkhoff, G., and S. MacLane, A Survey of Modern Algebra, revised edition. 
New York: The Macmillan Company, 1953. 


Eves, Howard, A Survey of Geometry, vol. I. Boston: Allyn and Bacon, Inc., 
1963. 


Hall, Marshall, Theory of Groups. New York: The Macmillan Company, 
1959. 


Hardy, G. H., and E. M. Wright, An Introduction to the Theory of Numbers, 
fourth edition. Oxford: The Clarendon Press, 1960. 


Postnikov, M. M., Fundamentals of Galois Theory. Groningen: P. Noordhoff, 
Ltd., 1962. 


van der Waerden, B. L., Modern Algebra. New York: F. Ungar Publishing 
Company, 1949. 


Zariski, O., and P. Samuel, Commutative Algebra, vol. |. Princeton: D. Van 
Nostrand Company, 1958. 


197 


198 Bibliography 


Historical References 


al-Khwarizmi, Robert of Chester’s Latin Translation of the Algebra of 
Muhammed ben Musa. New York: The Macmillan Company, 1915. 
Contains an English translation of the Latin version. 


Burkhardt, H., “Endliche Diskrete Gruppen,” Encyclopddie der Mathe- 
matischen Wissenschaften, Band I, Teil I, Heft 3. Leipzig, 1899. A 
survey of the history of the theory of finite groups up to 1899. (In 
German.) 


Cayley, Arthur, ‘On the theory of groups as depending on the symbolical 
equation 6" = 1,” Collected Works, vol. II, pp. 123-132. Cambridge: 
The University Press, 1889-97. Two short, easy articles in which groups 
are discussed abstractly. 


Cardano, Girolamo, The Great Art or the Rules of Algebra. Cambridge, 
Massachusetts: The M.I.T. Press, 1968. 


Dedekind, Richard, Sur la Théorie des Nombres Entiers Algébriques. Paris, 
1877. A beautiful little introduction to algebraic integers and ideal theory. 
(In French.) 


Euler, Leonard, An Introduction to the Elements of Algebra, fourth edition. 
Boston: Hilliard, Gray, and Company, 1836. A classic elementary 
textbook. 


Galois, Evariste, Ecrits et mémoirs mathématiques. Paris: Gauthier-Villars, 
1962. The definitive edition of the complete works of Galois. (In French.) 

Gauss, Karl Friedrich, Disquisitiones Arithmeticae. New Haven: Yale Univer- 
sity Press, 1966. (In English.) 


Lagrange, Joseph Louis, ‘‘ Réflexions sur la Résolution Algébrique des 
Equations,” Oeuvres de Lagrange, vol. 3, pp. 205-421. Paris: Gauthier- 
Villars, 1869. (In French.) 


Ruffini, Paolo, Teoria generale della equazioni in ciu si dimostra impossible la 
soluzione algebraica della equazioni generali di grado superiore al quarto. 
Bologna, 1799. (In Italian.) 


Waring, Edward, Meditationes Algebraicae. Cantabrigiae, 1770. (In Latin.) 


References are to articles. 


Abel, Niels Henrik, 29, 55, 139, 175 


abelian group, 29 
algebraic: 
closure, 1125 
element, 108 
extension, 108 
integer, 192 
number, 108 
number, norm of, 197 
alternating group, 81 
simplicity of, 83 
Ankeny, Nesmith, C., 101 


associated ring elements, 169 


associative law, 275 
automorphism: 

field, 122 

group, 64 

inner, 64 

ring, 165 

outer, 64 
axiom of choice, 183 


Bachet, 175 


Index 


basis of a ring extension, 189 


basis of a vector space, 93 
Bernoulli, Nicholas, 78a 
binomial theorem, 20y 


Cardan’s formula, 147 
cartesian product, 9 


Cauchy, Augustin Louis, 55, 78, 175 


Cauchy’s theorem, 55 
Cayley, Arthur, 77 
Cayley’s theorem, 77a 
center of a group, 358, 50 
characteristic: 

of a field, 89a 

of a ring, 1658 
characteristic subgroup, 64e 
choice axiom, 183 
class equation, 51 


199 


200 Index 


closure: divisor, greatest common, 23 
algebraic, 1126 for polynomials, 103 
integral, 188 for ring elements, 170 
separable, 113 divisor of zero, 152 

commutative group, 29 domain of a mapping, 10 

commutative ring with unity, 150 domain: 

complement of a subset, 88 Dedekind, 182 

composite mapping, 16 euclidean, 159 

composition series, 52, 73 integral, 152 

congruence: principal ideal, 158 


of integers, 18 

modulo subgroup, 37 
conjugacy class equation, 51 
conjugate: 

elements in a group, 45 

elements in a field, 130y 

subfields, 130y 
constant polynomial, 98 
content of a polynomial, 106 


effective transformation group, 53y, 76 
Eisenstein, F. G. M., 107 
Eisenstein irreducibility criterion, 107 
element: 

algebraic, 108 

integral, 188 


correspondence (one-to-one), 13 wi oe poe 
coset: proper, 151 
left, 37 separable, 113¢ 
right, 378 elementary symmetric functions, 131 
countable set, 15 empty set, 3 
cubic equations, 147 endomorphism: 


eyclem/9> 

cyclic group, 43 

cyclic permutation, 79 
cyclotomic polynomials, 134 


of groups, 60 

of rings, 165 
epimorphism: 

of groups, 67 

of rings, 165 
equation(s): 

conjugacy class, 51 


D’Alembert, Jean-le-rond, 101 cubic, 147 
Dedekind domain, 182 quadratic, 146 
Dedekind, Richard, 175 quartic, 148 
degree of a field extension, 96 quintic, 149 
degree of a polynomial, 98 solvable in radicals, 139 
derivative of a polynomial, 100a equivalence class, 17 
difference of sets, 8 equivalence relation, 17 
dihedral groups, 268 euclidean algorithm, 23¢ 
dimension of a vector space, 95 Euler, Leonard, 25, 42, 136, 175 
Diophantus of Alexandria, 175 Euler’s ¢-function, 25 
direct product: Euler’s theorem, 42 

of groups, 26¢ even permutation, 81 

of rings, 152n extension field, 96 
Dirichlet, Gustave Lejeune, 175 algebraic, 108 
discriminant: degree of, 96 

of a cubic, 147 finite, 96 

of a quartic, 148 Galois, 127 
disjoint sets, 5 normal, 129 
distributive laws, 7 radical, 143 
division theorem, 21 separable, 113e 


for polynomials, 99 simple, 114 


extension ring: 
finite, 189 
integral, 188 


factorization: 
improper, 169 
proper, 169 
Fermat, Pierre de, 135, 175 
Fermat’s last theorem, 175 
Fermat’s little theorem, 42 
Fermat’s numbers, 135 
field: 87 
automorphism of, 122 
characteristic of, 89a 
extension, 96 
fixed, 123 
ground, 96 
homororphism of, 877 
of fractions, 176 
perfect, 113y 
prime, 89 
splitting 110y, 132 
finite: 
dimensional vector space, 92 
extension field, 96 
extension ring, 189 
group, 38 
set, 15 
fixed field, 123 
formula, Cardan’s, 147 
formula, LaGrange interpolation, 100y 
four-group, 261 
fractionary ideals, 177 
principal, 177 
quotient of, 178 
fractions, field of, 176 
Frobenius, Georg, 59 
function: 
choice, 183 
elementary symmetric, 131 
Euler’s ¢, 25 
integral, 98a 
Mobius, 258, 1346 
rational, 98a, 176 
totient, 25 
fundamental theorem: 
of arithmetic, 24 
of algebra, 101 
of algebraic number theory, 196 
of Galois theory, 130 
of symmetric polynomials, 131 


Index 201 


Galois, Evariste, 139 
Galois extension, 127 
Galois group, 127 
Galois’s theorem, 145 
Galois tower, 142 
Gauss, Karl Friedrich, 101, 136, 175 
Gaussian integers, 154, 174 
Gaussian primes, 174 
Gauss’s lemma, 106 
greatest common divisor, 23 
for polynomials, 103 
for ring elements, 170 
ground field, 96 
group, 26 
abelian, 29 
alternating, 81 
automorphism of, 64 
center of, 355, 50 
commutative, 29 
conjugacy class equation, 51 
cyclic, 43 
dihedral, 268 
endomorphism of, 60 
epimorphism of, 67 
finite, 38 
Galois, 127 
Hamiltonian, 46+ 
infinite, 38 
of field automorphisms, 122 
of permutations, 76 
of order p”, 52 
of outer automorphisms, 64« 
of units in a ring, 151 
opposite, 268 
order of, 38 
prime power, 52 
quaternion, 46 
quotient, 47 
simple, 59e 
solvable, 75 
symmetric, 30, 76 
transformation, 53 
transitive, 86 
trivial, 26 
group product, 26 


Hamiltonian group, 46¢ 
heptadecagon (regular), 138 
Hermes, 136 

Hilbert, David, 175 


202 Index 


homomorphism: 
field, 87 
group, 60 
kernel of, 65 
Ting, 165 


ideals, 157 
fractionary, 177 
integral, 177 
invertible, 179 
maximal, 162 
prime, 163 
primigenial, 185 
principal, 158 
product of, 161 
proper, 157 
quotient of, 161e 
relatively prime, 167¢ 
sum of, 160 
identity element (of a group), 27 
image of a mapping, 11 
of a homomorphism, 66 
image, inverse, 12 
improper factorization, 169 
index of a subgroup, 39 
indicator, 25 
induction, 20 
infinite group, 38 
infinite set, 15 
inner automorphism, 64 
integer: 
algebraic, 192 
Gaussian, 154, 174 
of a number field, 192 
rational, 192 
integral: 
closure, 188 
domain, 152 
element, 188 
extension, 188 
function, 98a 
ideal, 177 
integrally closed subring, 188 
intersection of sets, 5 
invariant subgroup, 46, 64 
inverse image, 12 
invertible ideal, 179 
irreducibility criterion: 
Eisenstein, 107 
Netto, 1078 
isomorphism: 
of groups,, 287, 62 
of rings, 165 


isomorphism theorems: 
first, 68 
second, 69 
third (Zassenhaus), 70 


kernel of a homomorphism, 65, 165@ 
Klein, Felix, 26 

Kummer, Ernst Eduard, 155, 175 
Kummer rings, 155, 198 

Kronecker, Leopold, 103%, 175 
Kronecker’s theorem, 103¢ 


Lagrange, Joseph Louis, 40, 100n 
Lagrange interpolation formula, 100y 
Lagrange resolvent, 144 
Lagrange’s theorem, 40 
Lamé, Gabriel, 175 
LaPlace, Pierre Simon de, 175 
leading coefficient, 98 
least common multiple, 23y, 170 
left coset, 37 
Legendre, Adrien Marie, 175 
length of a group, 738 
linear dependence, 91 

of automorphisms, 124 
Liouville, Joseph, 175 


mapping, 10 

composite, 16 

domain of, 10 

one-to-one, 12 

onto, 11 

range of, 10 
McKay, James, 55 
maximal ideal, 162 
minimal polynomial, 109 
M6Obius function, 258, 1346 
monic polynomial, 99a 
monomorphism: 

of groups, 65 

of rings, 165 
multiple root, 1008, 101 
multiplicity of a root, 1018 


natural numbers, 15 

Netto, Eugen, 1076 
irreducibility criterion, 1076 

Newton’s identities, 1318 


Noetherian ring, 182 
noncommutative ring, 1508 
norm of an algebraic number, 197 
normal extension field, 129a 
normalizer, 48 
normal subgroup, 46 
null ring, 151 
number: 

algebraic, 108 

Fermat, 135 

natural, 15 

transcendental, 108 
number domain, 152 
number field, 88 


odd permutation, 81 
One-to-one correspondence, 13 
onto mapping, 11 

Opposite group, 266 

orbit, 54 

order of a group, 38 

outer automorphism, 64 


pentagon, regular, 137 
perfect field, 113y 
permutation, 30, 76 
cyclic, 79 
even, 81 
odd, 81 
permutation group, 76 
phi-function (Euler), 25 
polygons, regular, 135 
polynomial, 98 
constant, 98 
content of, 106 
cyclotomic, 134 
degree of, 98 
derivative of, 100a 
minimal, 109 
monic, 99a 
primitive, 105 
Tings, 156 
separable, 113a 
symmetric, 131 
power set, 14 
prime, 22 
Gaussian, 174 
element of a ring, 169 
field, 89 
ideal, 162 
power group, 52 
proper, 169 
relatively, 23, 167¢ 


Index 


primigenial ideal, 185 
primigenial ring, 185 
primitive element, 114a 
primitive polynomial, 105 
principal ideal, 158 
domain, 158 
fractionary, 177 
product: 
cartesian, 9 
direct, 260, 1527 
of ideals, 161 
of sets, 9 
proper: 
element of a ring, 151 
factorization, 169 
ideal, 157 
prime, 169 
subgroup, 35 
subset, 3 


quadratic equations, 146 
quartic equations, 148 
quaternion group, 46 
quintic equations, 55, 149 
quotient: 

group, 47 

of fractionary ideals, 178 

of ideals, 16le 

set, 17 


radical extension, 143 
radical tower, 142 
range of a mapping, 10 
rational function, 98a, 176 
rational integers, 192 
regular heptadecagon, 138 
regular pentagon, 137 
regular polygons, 135 
relation on sets, 10a 
relatively prime, 23 
ideals, 1672 
remainder theorem, 99 
repeated root, 101 
resolvent, LaGrange, 144 
Richelot, 136 
right coset, 378 
ring, 150 
automorphism of, 165 
characteristic of, 1658 
commutative with unity, 150 
endomorphism of, 165 
epimorphism of, 165 


203 


Zee A 21. N he a we > 


CY See ste} 


~ @BPswu nm 


SS = & 


The Greek Alphabet 


alpha 
beta 
gamma 
delta 
epsilon 
zeta 
eta 
theta 
iota 
kappa 
lambda 


mu 
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Symbols for Special Sets 


natural numbers 
{1525 «see, 
integers 

integers modulo n 
units of Z,, 
Gaussian integers 
rational numbers 
real numbers 


complex numbers 


QA Bo Aa Oo Ue = 


eS = = fe 


nu 
XE 
omicron 
pi 

rho 
sigma 
tau 
upsilon 
phi 

chi 

psi 
omega 
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